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WNT RECEPTOR COMPOSITIONS AND METHODS 



Field of the l^fVENnoN 

The present invention relates to serening methods employing Wm receptors. 

5 
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25 Background of the invention 

Wnt genes encode secreted proteins involved in ceil-to-cell signaling. Wnt genes 
play important growth controlling roles, in particular in the mammary gland, and act as 
oncogenes in mouse mammary tumors. Little is known about the mechanism of action of 
Wnt products, in part because Wnt receptors have until now remained unidentified. 

30 

Summary of the Invention 

In one aspect, the present invention includes an isolated nucleic acid molecule 
encoding a Wnt receptor polypeptide. In a general embodiment, the Wnt receptor 
polypq}tide has an amino acid sequence that is greater than about 90% identical to the 
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amino acid sequence of a Wnt receptor selected from the group consisting of Dfzl, Df22, 
Rfzl, Rfz2. Hfz3. Hfz4, Hfz5, Mfz3, Mfz4, Mfz5, Mfz6, Mfz7, Mfe8, and Cfzl. In a 
related embodiment, the Wnt receptor has an amino acid sequence that is more than about 
90% identical to an amino acid sequence selected from the group consisting of SEQ ID 
5 NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, 
SEQ ID NO: 14 and SEQ ID NO: 16. In another related embodiment, the Wnt receptor 
polypeptide has an amino acid sequence selected from the group consisting of SEQ ID 
NO:2. SEQ ID NO:4. SEQ ID NO:6, SEQ ID NO:8. SEQ ID NO: 10. SEQ ID NO: 12. 
SEQ ID NO: 14 and SEQ ID NO: 16. 
10 Examples of nucleic acid molecules encoding Wnt receptor polypq)tides are 

provided herein as SEQ ID NO:l. SEQ ID NO:3. SEQ ID NO:5. SEQ ID NO:7. SEQ ID 
NO:9. SEQ ID NO: 11, SEQ ID NO: 13 and SEQ ID NO: 15. Preferred embodiments are 
human Wnt polynucleotides. An exemplary human Wnt polynucleotide has the sequence 
presented as SEQ ID NO:9. 
15 The invention further includes fragments of polynucleotides encoding full-length 

WntR. where the fragments are of sufficient length to hybridize selectively with a Wnt 
polynucleotide sequence or conq>lement thereof, such as a sequence selected from the group 
consisting of SEQ ID NO:l. SEQ ID NO:3. SEQ ID NO:5, SEQ ID NO:7. SEQ ID NO:9. 
SEQ ID NO:ll. SEQ ID NO:13 and SEQ ID NO: 15. Such fragments are at least 15. 

20 preferably at least about 18. 21 or 24. nucleotides in length. 

In another aspect, the invention includes an isolated Wnt receptor polypeptide. In a 
general embodiment, the polypeptide has an amino acid sequence that is more than about 
90% identical to the amino acid sequence of a Wnt receptor selected from the group 
consisting of Dfzl. Dfz2. Rfzl. Rfz2, Hfe3. Hfz4. Hfz5. Mfz3. Mfz4. MfitS. Mfe6, Mf27. 

25 M fe8» and Cfzl . In a related embodiment, the polypeptide has an amino acid sequence that 
is more than about 90% identical to an amino acid sequence selected from the group 
consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6. SEQ ID NO:8, SEQ ID 
NO: 10. SEQ ID NO: 12. SEQ ID NO: 14 and SEQ ID NO: 16. In another related 
embodiment, the polypq)tide sequence is selected from the group consisting of SEQ ID 

30 NO:2, SEQ ID NO:4. SEQ ID NO:6. SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO: 12, 
SEQ ID NO: 14 and SEQ ID NO: 16. 

Preferred embodiments are human Wnt polypeptides. An exemplary human Wnt 
polypeptide has the sequence presented as SEQ ID NO: 10. 
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The invention further includes peptide fragments derived from a full-length WntR 
polypeptide* where the fragments contain a region of at least seven, preferably at least ten, 
consecutive amino acids, and where the region has at least about an 80% identity with the 
residues of a corresponding region of a polypeptide having a sequence selected from the 
5 group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID 
NO:10, SEQ ID NO:12, SEQ ID NO:14 and SEQ ID NO:16, 

Also included in the invention are antibodies, both monoclonal and polyclonal, 
specifically-immunoreactive with Wnt receptor polypeptides. Such antibodies may be 
produced using standard methods (Harlow). 

10 The invention also includes a method of identifying a compound capable of affecting 

binding of a Wnt polypeptide to a Wnt receptor polypeptide. The method includes <i) 
contacting such a Wnt receptor polypeptide with a selected Wnt polypeptide, in the presence 
and absence of a test compound, (ii) measuring the effect of the test compound on the 
extent of binding between the Wnt polypeptide and the Wnt receptor polypeptide, and (iii) 

IS identifying said compound as effective if its measured effect on the extent of binding is 
above a threshold level. In a general embodiment, the method includes an additional step 
(iv) comprising preparing a pharmaceutical preparation of a compound identified as 
effective to alter binding of a Wnt polypeptide to a WntR polypeptide. 

In one embodiment, the threshold is a 2-fold or greater inhibition of binding. In 

20 another embodiment, the threshold is a 2-fold or greater potentiation of binding. Examples 
of suitable Wnt pol)T)eptides include wingless (Wg); examples of suitable Wnt receptor 
polypeptides include Dfz2 (e.g., SEQ ID NO:2). 

The test compound may be effective to inhibit binding between the Wnt polypeptide 
and the Wnt receptor or to displace the Wnt polypeptide from the Wnt receptor polypeptide. 

25 In one embodiment, the Wnt receptor polypeptide is expressed on the surface of a cell (e.g., 
Drosophila Sneider 2 (S2) cell) transformed with an expression vector encoding said 
receptor (e.g., Dfz2). 

In another embodiment, the Wnt receptor polypeptide is an N-terminal portion of a 
full-length Wnt receptor polypeptide, the N-terminal portion including the cysteine-rich 

30 amino-terminal domain. In one embodiment, the N-terminal portion is part of a fusion 
with, e.g., the constant domain of human IgG. 
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25 



These and other objects and features of the invention will become more fully 
apparent when the following detailed description is read in conjunction with the 
accompanying drawings. 

Brief DESCRiFnoN of the Figures 

Figure 1 shows a sequence conq>arison of Dfzl and Dft2. 

Figure 2 shows hydropathy profiles of mammalian and nematode frizzled 
homologues. 

Figure 3 shows a computer-generated image of the expression of DFz2 during 
Drosophila development evaluated by Northern blot. 

Figure 4 is a computer-generated image showing that transfeaion of DFz2 into S2 
cells confers a response to Wg protein. 

Figure S is a computer-generated image made using confocal immunomicroscopy 
showing binding of Wg protein to Dfz-2 transfected cells. 

Figure 6 is a computer-gemrated image showing the binding of metabolically 
labeled Wg protein to a Dfz-2/Ig fusion protein. 

Detailed Description of the Invention 
I. Definitions 

A polynucleotide sequence or fragment is ''derived from" another polynucleotide 
sequence or fragment when it contains the same sequence of nucleotides as are present in 
the sequence or fragment from which it is derived. For example, a bacterial plasmid 
contains an insert "derived from"* a selected human gene if the sequence of the 
polynucleotides in the insert is the same as the sequence of the polynucleotides in the 
selected human gene. 

Similarly, a polypeptide sequence or fragment is "derived from" another polypeptide 
sequence or fragment when it contains the same sequence of amino acids as are present in 
the sequence or fragment from which it is derived. A polypeptide "derived from" a nucleic 
acid is a polypeptide encoded by that nucleic acid. For example, a Wnt receptor 
polypeptide derived from the htmian genome (also termed "human Wnt receptor 
polypeptide" or "hWntR") is a polypeptide encoded by an mRNA (or corresponding cDNA) 
transcribed from a human Wnt receptor gene. 

Percent (%) identity, with respect to two amino acid sequences, refers to the % of 
residues that are identical in the two sequences when the sequences are optimally aligned 
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and no penalty is assigned to '*gaps**. In other words, if a gap needs to be inserted into a 
first sequence to optimally align it with a second sequence, the % identity is calculated 
using only the residues that are paired with a corresponding amino acid residue (i.e., the 
calculation does not consider residues in the second sequences that are in the "gap" of the 
S first sequence). Optimal alignment is defined as the alignment giving the highest % identity 
score. Such alignments can be preformed as described herein using the "GENEWORKS" 
program. Alternatively, alignments may be performed using the local alignment program 
LALIGN with a knip of 1, default parameters and the default PAM. The LALIGN 
program is found in the FAST A version 1.7 suite of sequence conq^arison programs 

10 (Pearson and Lipman, 1988; Pearson, 1990; program available from William R. Pearson, 
Department of Biological Chemistry, Box 440, Jordan Hall, Charlottesville, VA). 

A full-length Wnt recq)tor (WntR) polypeptide is defined herein as a polypeptide 
that is a member of the frizzled protein family, encodes a full-length protein, and has at 
least about a 90% identity with one or more of the following sequences: SEQ ID NO:2, 

15 SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID 
NO: 14 and SEQ ID NO: 16. 

II. Overview of the Invention 

The present invention is based on the discovery of a set of novel members of the 

20 vertebrate frizzled family of polarity genes, and on the recognition that the frizzled family 
of polarity genes encodes the receptors for the Wnt family of proteins. The invention is 
further enhanced by the recognition that the full-length sequence of each member of the 
frizzled protein family generally shares a substantially greater degree of homology with the 
full-length sequences of corresponding frizzled proteins in other species (typically about 

25 80% to > 95%) than it does with the full-length sequences of other members of the frizzled 
protein family in the same species (typically about 30% to 60%). Different members of the 
frizzled family, however, do contain regions within the coding sequences that have high 
degrees of homology (up tp 90% or more) with one another. This feature, combined with 
similar sizes and hydrophobicity profiles, facilitates the identification of novel members of 

30 the frizzled gene family. 

Discoveries described herein enable a number of uses and application of the present 
invention. These uses and applications are exemplified and discussed in detail below. 
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III. Identification of Dfz2 as the Wg Receptor 

Experiments performed in support of the present invention and described in 
Examples 1-6, below, indicate that Drosophila frizzled g^rv^ 2 (Dfz2) is a receptor for 
wingless (Wg). Example 1 details the cloning of Dfz2, the sequence of which is illustrated 
5 in Figure 1. Hydrophobicity profiles of additional frizzled family members isolated as part 
of the present invention are shown in Figure 2. Their sequences are presented in the 
Sequence Listing. Example 2 d^cribes in situ hybridization experiments to determine the 
pattern of Dfe2 expression. Example 3 describes Northern analyses (Fig. 3) showing that 
Dfz2 is expressed throughout development. 
10 In Example 4, below, Drosophila Sneider 2 (S2) cells were transformed with a Dfz2 

expression vector and the effects of the Df22 ligand, Wg, were assessed by measuring the 
levels of armadillo (Arm) protein in response to Wg application (Peifer, et aL, 1994; 
Riggleman, et aL, 1990; Van Leeuwen, et aL, 1994). The results, shown in Figure 4. 
demonstrate that all four D^-transfected S2 cell lines tested showed increased armadillo 
IS signal in response to Wg, whereas no such effect was observed with untransfected S2 cells. 
These results demonstrate that DfiQ, acts as a signal transducing molecule for Wg, consistent 
with it being a receptor for Wg. 

Further support is provided by immunohistochemical analyses described in Example 
5. These experiments were designed to address whether Wg was capable of binding to the 
20 Dfz2-tran5fected cells. D£z2-transfected and nontransfected cells were exposed to medium 
containing Wg protein, washed, stained with an anti-Wg antiserum and a labelled secondary 
antibody, and imaged using a confocal microscope. Exemplary images, shown in Figs SA~ 
5F, demonstrate that approximately 80% of Dfz2-transfected S2 cells exposed to Wg protein 
stained brightly (Fig. 5D) whereas Dfz2*transfected cells in the absence of Wg protein (Fig. 
25 5 A) as well as non transfected S2 cells (Fig. SB) did not. The ability of Wg to bind was 
also tested in human 293 cells, which are heterologous to the Dfz2 protein. As shown in 
Fig, 5F, about 10-20% of the transfected cells remained positive, similar to the transfection 
efficiency of 293 cells. Since 293 cells are of human origin, these results indicate that Wg 
binds to Dfz2 itself, rather than to a molecule whose expression is induced by Dfz2. 
30 The binding of Wg protein to Dfz2 was further confirmed using a fusion protein 

containing the cysteine-rich amino-terminal domain of Dfz2, linked to the constant domain 
of human IgG, as described in Example 6. The fusion protein or IgG control was added to 
conditioned medium from normal S2 cells, or S2 cells producing Wg (HS-wg/S2), which 
had been metabolically-labeled with pS] cysteine and methionine. 
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The fusion proteins and possible complexes were then isolated and analyzed by gel 
electrophoresis and fluorography (Fig. 6). Two bands of approximately 52 kd (the size of 
Wg) were detected in the lane with the Dfz2-Ig fusion added to the medium of HS-wg/S2 
cells. 

5 The above results taken together, particularly the observations that (i) Wg binds to 

DFz2, and (ii) the binding leads to a biological response, strongly support the role of Dfz2 
as the receptor for the Wg protein. 



IV. Novel Frizzled Familv Members Identified in Vertebrates 

10 Experiments performed in support of the present invention have further resulted in 

the identification of at least six novel frizzled family members in human and mouse. This 
brings the total number of frizzled-like sequences identified in mammalian genomes to 8, 
since two (R&l and Rfz2) were previously cloned from rat (Chan» et aL, 1992). The six 
novel genes include Mfz3, Mfe4, Mfz7, and M£z8, as well as human sequences 

IS Hfz3, HfeS and Hfz7. A sequence 95% identical over 143 amino acids to HfzS was PCR- 
amplified (MuUis, 1987; MuUis, et aL, 1987) from mouse genomic DNA using Hfz5- 
specific primers, suggesting that an Mfz5 gene exists as well. The DNA and translated 
amino acid sequences of these 6 family members are provided in the Sequence Listing, 
along with the sequence of a novel family member isolated from C. elegans (Cfzl). The 

20 hydrophobicity profiles of these sequences are presented in Figure 2. These profiles, along 
with the sequences of regions that are conserved among different frizzled family members, 
are used in determining whether a polypeptide sequence is a member of the frizzled gene 
family. According to the present invention, member of this family are considered to be 
Wnt receptors. 

25 Using the guidance herein, one of skill in the art can isolate additional members of 

the frizzled gene family. In particular, probes homologous to regions conserved among the 
various family members can be designed and used to probe cDNA or genomic DNA 
libraries. Alternatively or in addition, PGR primers corresponding to such conserved 
regions may be designed and used to isolate additional sequences from any suitable source 

30 of DNA, including libraries and reverse transcription (RT) -generated cDNA samples. 



V. Wnt Genes and Proteins 

Wg in Drosophila is part of larger gene family (Eisenberg, et at., 1992; Graba, et 
aL^ 1995; Russell, et aL, 1992) of Wnt genes. At least 3 homologous genes have been 
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identified in Drosophila^ and over 10 Wnt genes have been identified in most vertebrates 
(Nusse and Varmus, 1992). According to the present invention, the products of these genes 
are the ligands for receptors encoded by the large family of fz-like genes in vertebrates. 
Determination of which Wnt gene products are specific to which Wnt receptor may be 
S performed by one of skill in the art following the teachings of the present specification. 
All members of the V/nt family encode secreted proteins that act as cell-cell 
signaling molecules. Wnt genes play an important role in the control of cell growth, 
particularly in the mammary gland, and can act as oncogenes in mouse mammary tumors. 
The proteins contain a signal sequence, one or several N-linked glycosylation sites and 

10 many cysteine residues. The product of the mouse Wm-l gene has been smdied most 

extensively. If WmA is overexpressed in various cell lines, the protein enters the secretory 
pathway. The protein can be detected in protease resistant structures, presumably secretory 
vesicles, and contains carbohydrate structures at several N-linked glycosylation sites. It is 
thus generally assumed that the Wnt-l protein is secreted from cells, although extracellular 

IS forms of the protein have been difficult to detect. In addition, most of the intracellular 

Wnr-1 protein made in transfected cells is incompletely glycosylated (it remains sensitive to 
endoglycosidase H) and has probably not traversed the Golgi apparatus. Moreover, much 
of the WntA protein becomes associated with the resident ER protein BiP, indicating that it 
is incorrectly folded. 

20 In spite of these difficulties, it has been shown that Wnt-l overproduction leads to 

secretion of modest amounts of extracellular protein. The secreted forms have undergone 
more extensive glycosylations, and may bind to the cell surface or to the extracellular 
matrix. 

25 VI. Role of Wnt in Cancer 

Members of the Wnt gene family are important regulators of mammary cell growth. 
Indeed, Wnt genes owe their discovery to their role as oncogenes in mouse mammary 
cancer: previous experiments which examined the sequence around integration sites for 
Mouse Mammary Tumor Virus (MMTV) DNA showed that many tumors had sustained 

30 proviral insertions near the Wnt-\ gene, the first member of this gene family, A biological 
assay for Wnt-\ was subsequently established using gene transfer experiments. This assay 
was used to show that certain mammary gland-derived cell lines can be morphologically 
transformed by Wnt-l. Direct evidence that Wnt-l expression gives a strong growth 
stimulus to mammary cells came from transgenic mice carrying Wnt-\ linked to the MMTV 
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promoter, which developed mammary hyperplasia and tumors. By infecting primary 
mammary cells with retroviruses expressing Wnt-l and re- implantation of the infected cells, 
similar hyperplasia of the mammary gland were obtained. Additional experiments led to the 
identification of a Wnt-l related oncogene activated by MMTV insertion, called W/tr-3. 
S The growth stimulus generated by the expression of Wnt-l in the mammary gland 

implies that mammary cells are equipped with a Wnt receptor that becomes activated by the 
Wnt'l protein, as well as the other signaling conq>onents. While neither Wnt-l nor Wnt-S 
are expressed in the normal mammary gland, at least S other Wnt genes are expressed 
during specific stages of mammary gland development, including during the rapid expansion 

10 of the pre-lactating gland or when the gland regresses. 

The oncogenic action of WntA and Wnt-S is best explained by their acting as ligands 
for Wnt receptors meant for other Wnt genes, and activating these receptors inappropriately. 
Alternatively, Wnt-l and Wnt-S may not activate these receptors but may interfere with a 
ligand-receptor interaction normally leading to regression of the gland. 

IS The strong growth stimulus by oncogenic Wnt genes and the dynamic expression 

patterns of other Wnt genes in the mammary gland provide evidence that Wnt genes are 
important regulators of mammary gland growth. It is also possible that WNT genes other 
than WNT-\ and WNT-S are involved in human breast cancer. In analogy with the mouse, it 
is likely that some of these are expressed during the normal cycles of growth of the 

20 manunary gland. In contrast to silent genes, genes that are expressed are candidates to 
become amplified, since the ensuing overexpression of those genes can give a selective 
advantage to cells even during the first rounds of amplification. 

By way of illustration, a survey of mouse mammary tumors identified one tumor 
where the mouse Wnt-2 gene was amplified and overexpressed whereas Wnt-2 had a low 

25 level of expression in the normal gland. Further, there was no evidence for insertion of 
MMTV near Wnt-l in that tumor. This finding shows that Wnt genes are not necessarily 
activated only by MMTV, a relevant factor for human breast cancer since that disease has 
no viral etiology but is often characterized by gene amplification. 

30 VII. Screening Methods 

In view of the role of Wnt in cancer and other processes involving growth, 
development and proliferation (both normal and abnormal), it would be desirable to identify 
modulators of Wnt activity that affect the interactions of specific Wnt proteins with their 
receptors. Such modulators may, for example, inhibit the binding of Wnt to its receptor 
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(e.g., by competitive or noncompetitive inhibition) » or they may potentiate or stabilize the 
binding. The recognition that members of the frizzled family of proteins can act as 
receptors for the Wnt family of proteins enables a number of screening approaches to the 
isolation of such modulatory confounds that have heretofore not been possible. 
S Examples of such screening approaches include protein-protein binding assays in 

which the level of binding of Wnt to its receptor, or a biological consequence of such 
binding, is measured. The latter assay is exemplified in Example 4, where cells not 
normally expressing Wnt receptors are transformed with a Wnt receptor (in this case, 
Dfe2), and the effects of Wnt (in this case, Wg) on the cells are measured (in this case, by 
10 detecting levels of Arm). Such cells may be transformed with the Wnt receptor of choice 
(e.g.. any of fzl, fe2, fz3, fz4, fz5, fe6, fz7 or fz8 recqjtors). 

In Example 4, expression of Arm was detected using a Western blot method. Other 
methods may be employed which are more suitable for high throughput screening 
applications. For example, labelled ami* Arm antibodies may be used to directly visualize 
15 levels of Arm in multi-well fomiat screen. 

Alternatively, the assays may simply detect the degree of binding between Wnt 
ligands and Wnt receptors, and not the biological consequences of such binding. For 
example, cells expressing a selected Wnt receptor may be plated in the wells of a 96-wel} 
plate and contacted with a solution containing reporter-labeled Wnt (e.g., radiolabelled of 
20 fluorescently-tagged) in the presence and absence of a test compound (i.e., a puutive 

modulator of Wnt/receptor interactions). The effect of the test compound on the extent of 
binding between Wnt and Wnt receptor is measured, and the compound is identified as 
effective if its effect on the extent of binding is above a threshold level (e.g., a several-fold 
difference in binding level between control and experimental sanq>les) In one embodiment, 
25 the threshold is a 2-fold difference. In another embodiment, it is a 5-fold difference. In 
yet another it is a 10-fold or greater difference. The difference in binding in the presence 
and absence of an effective test compound is preferably statistically-significant^ as 
determined by a standard statistical test. 

It will be appreciated that the putative modulator compound can alternatively be 
30 added after the cells had been incubated with labelled Wnt. In a screen for inhibitors of 
binding, the system is assayed for a decrease in the signal reflecting bound labelled Wnt, or 
an increase in the signal reflecting labelled Wnt in solution. 

Such a screen may also be employed to screen for potentiators of Wnt/recqjtor 
interactions. For example, test compounds may be added to the wells (either during or 



BNSDOCID: <WO 9739357A1 I > 



wo 97/39357 




PCT/US97/06049 



12 



after incubation with labelled Wnt), and the wells then contacted with unlabeled Wnt. Test 
compounds in wells where the unlabelled Wnt is less effective at displacing the bound 
labelled Wnt are selected for more detailed examination of ability to potentiate Wnt/receptor 
binding. 

5 Assays such as described above may also be used to determine the relationship 

between different Wnt proteins and different recq>tors. For exan^le, the ligand 
concentration dependence of binding may be used in measurement of the relative affinities 
of selected Wnt receptors with selected ligands, and ligands with a selected affinity for the 
receptor can be examined further using, e,g., in vitro or in vivo assays. In this manner, 
10 one of skill in the art can identify which Wnt protein(s) is optimally paired with which 
receptor(s). 

In cases where the Wnt ligand has been matched to a specific Wnt receptor (e.g., in 
the case of Wg and Dfz2), the receptor/ligand pair can be used in, e.g., screening 
applications. For example, the pair may be used in a binding assay to screen for 

IS compounds which are effective to modulate the binding of the specific ligand to its receptor. 
These methods enable the identification of compounds with two general types of activities: 
(i) those which act generally, e.g., on a class of WntAVnt receptor pairs, to disrupt or 
facilitate binding, and (ii) those which act selectively disrupt or facilitate the binding 
between a selected Wnt ligand and its receptor, but not between other Wnt ligands and their 

20 receptors. 

Compounds identified by one of the screens described herein may be further 
evaluated for efficacy using an in vitro assay such as described above. Further, such 
compounds may be tested in in vivo models employing WntAVnt receptor interactions. For 
example, the compounds may be tested in a mouse mammary tumor model for effectiveness 
25 at inhibiting growth of manunary tumors. 

VIII. Compounds Suitable for Screening 

A variety of different compounds may be screened using methods of the present 
invention. They include peptides, macromolecules, small molecules, chemical and/or 
30 biological mixtures, and fungal, bacterial, or algal extracts. Such compounds, or mole- 
cules, may be either biological, synthetic organic, or even inorganic compounds, and may 
be obtained from a number of sources, including pharmaceutical companies and specialty 
suppliers of libraries (e.g., combinatorial libraries) of compounds. 
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In cases where an identified active compound is a peptide, the peptide may be 
utilized to design a peptoid mimetic and aid in the discovery of orally-active small molecule 
mimetics. Alternatively, the peptides themselves may be used as therapeutics. 

Further, the structure of a bioactive polypq)tide may be determined using, for 
5 example, NMR, and may be used to select the types of small molecules screened. 

Methods of the present invention are well suited for screening libraries of 
compounds in multi-well plates (e.g., 96-well plates), with a different test compound in 
each well. In particular, the methods may be employed with combinatorial libraries. A 
variety of combinatorial libraries of random-sequence oligonucleotides, polypeptides, or 
10 synthetic oligomers have been proposed (Kramer, et aL, 1993; Houghten, 1985. 1994; 
Houghten, et aL, 1986. 1991. 1992; Ohlmayer, etaL, 1993; Dooley, et aL, 1993a.l993b; 
Eichler, etal., 1993; Pinilla, era/., 1992. 1993; Ecker, et at., 1993; and Barbas, et al., 
19SKJ). A number of small*molecule libraries have also been developed {e,g,, Bunin, et at., 
1994; Bunin and EUman, 1992; Virgilio and Ellman, 1994). 
IS Combinatorial libraries of oligomers may be formed by a variety of solution-phase 

or solid-phase methods in which mixtures of different subunits are added stepwise to 
growing oligomers or parent compound, until a desired oligomer size is reached (typically 
hexapeptide or heptapeptide). A library of increasing complexity can be formed in this 
manner, for example, by pooling multiple choices of reagents with each additional subunit 
20 step (Houghten, et at., 1991). 

Alternatively, the library may be formed by solid-phase synthetic methods in which 
beads containmg different-sequence oligomers that form the library are alternately mixed 
and separated, with one of a selected number of subunits being added to each group of 
separated beads at each step (Purka, et al., 1991; Lam, et aL, 1991, 1993; Zuckermann, et 
25 aL, 1992; Sebestyen, etal., 1993). 

The identity of library compounds with desired effects on the binding of a Wnt to a 
Wnt receptor can be determined by conventional means, such as iterative synthesis methods 
in which sublibraries containing known residues in one subunit position only are identified 
as containing active compoimds. 

30 

IX. Pharmaceutical Preparations of Active Compounds 

After identifying certain test compounds as potential WntR agonists or antagonists, 
the practitioner of the screening assay will typically continue to test the efficacy and 
specificity of the selected compounds both in vitro and in vivo. Whether for subsequent in 
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vivo testing, or for administration to an animal as an approved drug, agents identified in the 
screening assay can be formulated in pharmaceutical preparations for in vivo administration 
to an animal, preferably a human. 

The compounds selected in the screening assay, or a pharmaceutically acceptable 
5 salt thereof, may accordingly be formulated for administration with a biologically acceptable 
medium, such as water, buffered saline, polyol (for example, glycerol, propylene glycol, 
liquid polyethylene glycol and the like) or suitable mixtures thereof. The optimum 
concentration of the active ingredient(s) in the chosen medium can be determined 
empirically, according to procedures well known to medicinal chemists. As used herein, 

10 "biologically acceptable medium** includes any and all solvents, dispersion media, and the 
like which may be appropriate for the desired route of administration of the pharmaceutical 
preparation. The use of such media for pharmaceutically active substances is known in the 
art. Except insofar as any conventional media or agent is incompatible with the activity of 
the compound, its use in the pharmaceutical preparation of the invention is contemplated. 

15 Suitable vehicles and their formulation inclusive of other proteins are described, for 

example, in Gennaro, 1990. These vehicles include injectable "deposit formulations**. 
Based on the above, such pharmaceutical formulations include, although not exclusively, 
solutions or freeze-dried powders of the compound in association with one or more 
pharmaceutically acceptable vehicles or diluents, and contained in buffered media at a 

20 suitable pH and isosmotic with physiological fluids. In a preferred embodiment, the 
compound can be disposed in a sterile preparation for topical and/or systemic 
administration. In the case of freeze-dried preparations, supporting excipients such as, but 
not exclusively, mannitol or glycine may be used and appropriate buffered solutions of the 
desired volume will be provided so as to obtain adequate isotonic buffered solutions of the 

25 desired pH. Similar solutions may also be used for the pharmaceutical compositions in 
isotonic solutions of the desired volume and include, but not exclusively, the use of 
buffered saline solutions with phosphate or citrate at suitable concentrations so as to obtain 
at all times isotonic pharmaceutical preparations of the desired pH (for example, neutral 
pH). 

30 

The following examples illustrate but in no way are intended to limit the present 
invention. 
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Materials and Methods 
Unless otherwise indicated, restriction enzymes and DNA modifying enzymes were 
obtained from New England Biolabs (Beverly, MA) or Boehringer Mannheim (Indianapolis, 
IN). Nitrocellulose paper was obuined from Schleicher and Schuell (Keene, NH). Other 
5 chemicals were purchased from Sigma (St. Louis, MO) or United States Biochemical 
(Cleveland, OH). Unless otherwise specified, the experiments were performed using 
standard methods (Ausubel. etal., 1988; Sambrook. etal., 1989; Harlow, etal., 1988). 



A. Buffers 

10 

Phospfaate-buffered saline (PBS) 
lOx stock solution, 1 liter: 
80 g NaCI 
2gKCl 

15 11.5 g Na2HP04-7H20 

2 g KH2PO4 

Working solution, pH 7.3: 
137 mM NaCI 
2.7 mM KCI 
20 4.3 mM Na2HP04-7H,0 

1.4mMKH2P04 



EXAMPU 1 
Molecular Cloning of DFa2 
25 Polymerase chain reaction (PGR; Mullis, 1987; Mullis, et al., 1987) primer pools 

YW157 and YW158 were designed based on sequences (SEQ ID NO: 16. SEQ ID NO:17, 
respectively) conserved in Dfel, Human frizzled 3 (Hfz3). Rat frizzled 1 (Rfzl) and Rat 
frizzled 2 (Rfe2). The primer pools were completely degenerate, that is. each possible 
codon of each amino acid in SEQ ID NO: 16 and SEQ ID NO: 17 was represented in the 
30 respective primer pool, with the exception that the wobble base of the 3'-most codon was 
not included in YW157. The primers were used to amplily Drosophila genomic DNA, 
resulting in an amplification product that, when sequenced, was found to contain a novel 
frizxled family member - Dfz2. The PGR product was used to isolate genomic clones of 
Dfe2 from an adult Drosophila genomic library (Maniatis. et al.) and cDNA clones from a 
35 0-24 hr cDNA library. 

The amino acid sequence of Dfz2 was compared to that of Dfzl by aligning the 
sequences as shown in Fig. 1. Dfe2 and Dfzl are 32% identical. Idemical residues are 
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indicated in the consensus and the conserved cysteine residues in the cysteine-rich domain 
are in bold-face. The sequence alignments were done using the "GENEWORKS" program. 

Hydropathy values were calculated using the "MACVECTOR" 3.5 software 
according to the Kyte-Doolittle software and a window size of 15 amino acids. 

5 

Example 2 
In Situ RNA Hybridization 
In situ hybridization experiments were performed to determine the pattern of Dfz2 
expression. Freshly dissected adult brains, whole embryos or heads were rapidly frozen in 
10 plastic molds placed on a dry ice/alcohol slurry and processed for sectioning as described 
previously (Cole, et al., 1990). ^^S-Labeled antisense riboprobes were prepared from 
linearized p-BLUESCRIPT" plasmid subclones using either T3 or T7 RNA polymerase. In 
situ hybridization was performed as described by Saffen, et al , and hybridized sections 
were exposed to X-ray film and digitized. 

15 

Example 3 

Expression of DFz2 During Drosoohila Development 
The expression pattern of DFz2 was assessed using Northern (RNA) blot analysis. 
Total RNA was isolated using the LiCl-Urea precipitation method (Auffray and Rougeon, 

20 1980). 30 microgram of RNA from each sample was resolved on a formaldehyde 1 % 

agarose gel. The RNA was transferred to a nylon filter, cross-linked by UV irradiation and 
hybridized to a probe made by random priming Dfz2 or RP49 DNA fragments using 
standard methods (Sambrook, et aL, 1989). In other experiments, Poly (A)^ RNA from 
various stages of Drosophila development was first selected from total RNA using the 

25 Invitrogen "FASTTRACK" 2.0 kit and 5 ;xg was loaded per lane. 

Exemplary results are shown in Figure 3. A 4.0 kb transcript was detected in 
embryonic stages 0-2; 2-3; 4-5; 9-12, first, second and third instar larvae and pupae. A 
transcript of similar size was observed in Drosophila clone-8 cells (cl-8), a cell line from 
imaginal discs previously shown to be responsive to Wg activity in vitro, Drosophila 

30 Schneider 2 (S2) cells, which do not respond to Wg, did not contain detectable DFz2 
transcripts. The blot was also probed for expression of the ribosomal protein RP49 
(O'Connell and Rosbash« 1994, lower panel) as a control for RNA integrity and loading. 
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Example 4 

Transfection of DFz2 in S2 Cells Confers a Response to Wg protein 
S2 cells were evaluated for Dfz2 expression because the cells are known not to 
respond to Wg (Yanagawa, et al., 1995). Since, as described above, the native cells did 
5 not express D£z2, they were used in Dfz2 transfection experiments to determine whether 
expression of Dfz2 would confer sensitivity to Wg. 

An expression vector containing DFz2 coding sequences under the control of a 
metal-inducible metallothionein promoter was used to transfect S2 ceils using standard 
methods. Stable cell lines were derived by selection in hygromycin and tested for Dfz2 
10 expression. In cells grown in the absence of inducers, a baseline level of expression was 
detected with an antiserum to Dfz2. Induction of the metallothionein promoter resulted in 
increased levels of expression. 

Sensitivity of the Dfz2-transfected S2 cells to Wg protein was assessed by 
measuring the levels of armadillo (Arm) protein in response to Wg application. In intact 
IS Drosophila embryos and in clone-8 cells. Arm protein migrates in two different forms, 
differing from each other in phosphorylation. When these cells are incubated in the 
presence of soluble Wg protein, the level of the faster migrating (non-phosphorylated) form 
increases (Peifer, era/., 1994; Riggleman, e/a/., 1990; Van Leeuwen, etal., 1994). This 
increase can be detected using a standard Western blot assay as described below. 
20 Conditioned medium containing Wg protein was produced by subjecting S2HSwg 

cells to heat*shock for 30 minutes at 37**C, allowing the cells to recover for 30 minutes at 
25 •C, and resuspending them in S2 medium without fetal calf serum (FCS). The cells were 
incubated for 3 hrs to allow secretion of proteins into the medium, after which they were 
removed by centriftjgation (10 min., 2000 xg and Ihr, 100,000 xg, respectively). The 
25 conditioned media were concentrated 12-fold ("CENTRIPREPSO**, Amicon) and used in the 
experiments as follows. 

Clone 8, untransformed S2, and Dfz-transformed S2 (S2DJz2) cells were incubated 
for 2 hrs in 6-welI dishes in either normal concentrated medium or in concentrated medium 
from S2 cells producing Wg. 
30 Overexpression of the Dfz2 gene (under control of the metallothionein promoter) was 

induced by culturing S2Dfz2 and S2 control cells in S2 medium containing 0,5 mM CUSO4 
for 5 hrs prior to the incubation with the conditioned media. 

The target cells were lysed in lysis buffer (50 mM Tris, pH 7.5, 150 mM NaCI, 
1 % Nonidet-P40, 5 mM EDTA) supplemented with 20 fxg leupeptin. 100 /xg aprotinin and 
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180 fig PMSF per ml. The extracts were subjected to electrophoresis and Western blotting. 
Blots were stained in Ponceau Red to evaluate equal loading of total protein and transfer, 
and then incubated overnight in blocking buffer with monoclonal anti-arm antibody 7A1 at a 
1:1000 dilution or rat-polyclonal anti- a-catenin antibody DCAT-1 (Oda, et aL^ 1993), 
S diluted 1:1000. The blots were washed three times for 15 min each in TBST and incubated 
for 1 hr with horseradish peroxidase conjugated secondary antibodies (Biorad) diluted 
1:20,000 in blocking buffer. 

Incubation of DFz2-transfected S2 cells (but not untransfected S2 cells) in the 
presence of soluble Wg protein resulted in an increase in the level of Arm protein similar to 

10 that observed in Drosophila embryos and clone-8 cells. Exemplary results are shown in 
Fig. 4. Addition of Wg (wingless) results in increased signal intensity of the armadillo 
band. No such effect is observed with untransfected S2 cells. However, all four 
independent Dfz2-transfected S2 cell lines, derived from two separate transfections, showed 
increased armadillo signal in response to Wg (two of the four are shown). Further 

15 induction of Dfz2 expression by copper sulphate in the transfected cells led to a slight 

decrease in the response to Wg. As a control for equal loading, the blots were stripped and 
incubated with an antiserum against a-catenin (lower panel). 



Example 5 

20 Wg Protein Binds to Dfz2 Transfected Cells 

The results described in Example 4 showed that Dfz2 acts as a signal transducing 
molecule for Wg, suggesting that it is a receptor for Wg. Immunohistochemical analyses 
were performed to determine whether Wg was capable of binding to the Dfz2-transfected 
cells. 

25 Nontransfected Sneider 2 (S2) cells and S2 cells expressing Dfz2 were washed twice 

in PBS and incubated with 1.5 ml of medium alone or 1.5 ml of a lOx concentrated stock 
of Wg conditioned medium at 4**C for 3 hours. After three 10 minute washes with PBS, 
the cells were fixed in 2% methanol-free formaldehyde (Polysciences, Inc) for 15 minutes at 
room temperature. Following three more 10 minute washes with PBS, affinity purified Wg 

30 antibody at 1/25 and 5% donkey serum were added to the cells in PBS and incubated 
overnight at 4°C. 

The antiserum was affinity-purified using a bacterial fusion protein containing a 
domain unique to Wg (the Wg insert — an 85 amino acid sequence not found in any wg 
orthologs). Previous experiments have indicated that this domain is dispensable for Wg 
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activity, that it probably does not participate in the interactions between Wg and its 
recq)tor. 

Following 3 additional 10 minute washes, fluorescent-labeled cy3 secondary 
antibody, donkey anti-rabbit (Sigma), at 1/100 and 5% donkey serum were added to the 
S cells for 1 hour at room traiperature. The cells were then washed 3 more times in PBS and 
mounted in Vectashield mounting medium (Vector). 

Confocal images were collected with a Bio-Rad MRC 1000 confocal laser attached 
to a Zeiss Axio scope microscope. Exraiplary images are shown in Figs SA-SF. Normal 
and transfected cells were incubated with either normal S2 medium (Fig. SA) or 

10 concentrated conditioned medium from S2 cells producing Wg (Figs. SB. SC, SD, SE, SF). 
Dfz2-transf6Cted S2 cells stained brightly in approximately 80% of the cells when incubated 
with Wg and the antiserum (Figure 5D) whereas Dfz2-transfected cells in the absence of 
Wg protein (Fig. SA) as well as non transfected S2 cells (Fig. SB) showed only some spots 
of background staining. The positive staining was not uniform over the cell surface but 

IS punctate and may reflect clustering of receptor complexes. 

The ability of Wg to bind was also tested in heterologous cells (human 293 cells) 
transiently-transfected with Dfz2. In view of high background binding observed in initial 
experiments, the transiently-transfected 293 cells were preincubated with chlorate, which 
inhibits sulfation of proteins and glucosaminoglycans, and with heparatinase, to remove 

20 heparin-Iike molecules. This pre-treatment significantly lowered the backgrotmd binding 
(presumably due to Wg binding to extracellular matrix; Fig. SE). As shown in Fig. SF, 
about 10-20% of the transfected cells remained positive, similar to the transfection 
efficiency of 293 cells. Since 293 cells are of human origin, these results strongly suggest 
that Wg binds to Dfz2 itself, rather than to a molecule whose expression is induced by 

2S Dfz2. 

In contrast to the positive staining patterns observed with Dfz2-transfected cells, no 
staining was detected in S2 cells expressing Notch (Fig. 5C). Notch is a protein that has 
been previously proposed to act as a receptor for Wg (Couso and Arias, 1994). 

The above results taken together indicate that Wg protein can specifically bind to 
30 cells expressing Dfz2, and that this binding is not likely due to clonal variation. 
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Example 6 



Binding of Metabolicallv-Labeled Wg Protein to a Dfz-2/lgG Fusion Protein 
The binding of Wg protein to Dfz2 itself was also assayed using a fusion protein 

containing the cysteine-rich amino-terminal domain of Dfz2, linked to the constant dotnain 
5 of human IgG. The fusion protein or IgG control was added to conditioned medium from 

normal S2 cells, or S2 cells producing Wg (HS-wg/S2), which had been metabolically- 

labeled with pS] cysteine and methionine. 

The fusion proteins and possible complexes were then retrieved by adding 

sepharose-ProteinA beads and analyzed by gel electrophoresis and fluorography. Figure 6 
10 shows that the Dfzl fusion protein, but not the control IgG, selectively buids to labeled 

proteins of 52 kD, the size of the mature Wg protein. Normal S2 cells did not produce 

Dfz-2 binding proteins. 

While the invention has been described with reference to specific methods and 
IS embodiments, it is appreciated that various modifications and changes may be made without 
departing from the invention. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: The Board of Trustees of the Leland Stanford 
Junior University, et al. 

(ii) TITLE OF INVENTION: Wnt Receptor Compositions and Methods 

(iii) NUMBER OF SEQUENCES: 18 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Dehlinger & Associates 

(B) STREET: 350 Cambridge Avenue, Suite 250 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 94306 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: ll-APR-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/015,307 

(B) FILING DATE: 12-APR-1996 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Sholtz, Charles K. 

(B) REGISTRATION NUMBER: 38,615 

(C) REFERENCE/DOCKET NUMBER: 8600-0167.41 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 324-0880 

(B) TELEFAX: (415) 324-0960 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2344 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: D£z2 Polynucleotide, coding region 
begins at nucleotide #225 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GCGCTGTGTC TGAAGGAAAC ACTACCCGCT TTTCCGGCTC TCGAGGCGCC TCCACGAAGG 60 
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AGTGAGGTGC 


AAC C G C AG AG 


AAGGTCAGCA 


AAGAAAGAGC 


AAGG GG TTP P 


A Af2TPapar»a 

AAVa X C ACAC A 


1 0 A 
u 




GCTAAGACGP 


APAAAATGAG 


APAPAATPGA 


PTGAAGGTPP 
V- X vai-VrkVaVa X CC 


XVaAXCC XGGva 






V» X V3k» X A X 


C X X \3 X CV3AI3C 


l3vaA X VaVarACCw 


PTYSP a p a nr»r» 

C X vaC ACAva X va 


C GG ATcACGG 


240 




n X V3V3W VMTt J. V3V3 


GTGGTPAPGG 

W X V3V3 X W#VV»V3U 


PPTGGAPGPG 

V» w A \3V3nV»W Vw%9 


AGTPPPfSPa P 
Ava X cccvaCAC 


CCGGX XAcGVa 


300 


AGTGCCAGCC 


ATACCCAAGG 


ATCCCAATCT 


GCGATGCGAG 


GAGATPAPPA 


T A PP A AT<'2TV2 
X AC v«AA X Va XAa 




TfV^nGGPATT 


w\3%w X AV«AnV.n 


*Pda P aTPP 


PPPPAAPGAA 


A X VaAACCA X Va 


a a A r^r^ A 
AG AC C C AGGA 


420 


AnV9k« VarVsVar V» 


V* X V3k3Al3V3 X OC 


appanTTPT^a 

ACIm^AVVX XCXV3 


GPPPPTGrSTfS 
\3C CCC X VaVa X l3 


vaAlaAX CAAAX 


GCTcGccGGA 


480 


CCTCAAGTTC 


TTCCTGT6CA 


a nv^'pa a 
VvCAxAjrXACAV.. 


vaCCcAXCXCaC 


CTGGAGGATT 


ACCACAAGCC 


54 0 




TGCCJGGAGTG 


m/ '*H*r*r^r* tv 71 ^ 
TCTCsUCaAlsACv 


ACaCCCCaC X CG 


GGATGCGCAC 


CCATCATGCA 


600 




X X CioAAXAvoC 


rrsT'ar'a^aa'P 

C IsoAl? AVa AA X 


\avaC\a X VaCVaAia 


CAC X X VaCCcC. 


^Pnt^ TV ••!/'•/ ■I<T%^^ TV 

X XCATGGTGA 


660 


U C V. CvxACAA J. 


L. X \3 X \3V. A X V3V9 


AAC AuCCw X C 


va X ACAC(jVaA\a 


VaC xvaGcAGcG 


GTGGcAG cTC 


720 


GGGCGG ATC G 


GGTGGCTCTG 


G cAGcGGTT c 


cGGcTccGGc 


^ ^ TV TV TV /~»/~«/^ TV 

GG c AAAcGG A 


AGCAAGGAGG 


780 


CAGTGGCTCG 


GGCGGCAGTG 


GGGCCGGCGG 


cAGcAGcGGT 


TCCACcTCAA 


CGAAGCCGTG 


840 


CCGCGGACGC 


AATTCAAAAA 


ACTGCCAAAA 


TV r^/~«Ts 

TccccAAGGA 


GAAAAGGCAA 


GCGGAAAAGA 


900 


GTGCAGCTGC 


TCGTGCCGCT 


CCCCACTCAT 


CTTC cTGGGG 


AAGGAGCACT 


GGCTGCAGCA 


960 


GCAGTCGCAG 


TV n>/^ ^ Jv TV 

ATGCC CATGA 


TGCAccATcC 


AC AC cAC X (jlj 


T A 0 TV nr«/^ TV TV r^K^ 
TAcATGAAcC 


TcAcTGTCCA 


1020 


AAGG ATCG CC 


GGCGTTCCAA 


ACTG CGGCAT 


TV /~»^/^ fT^r^ TV TV 

AccGTGcAAG 


GGGCCCTTCT 


Tl ^•^«Tk ^ ^T^^ Tk 

TCAGCAACGA 


1080 


CGAAAAGGAT 


TTCGCCGGCC 


TCTGGATCGC 


CCTGxGGTcG 


GGACTGTGcT 


TCTGCAGCAC 


1140 


GCTCATGACC 


/ Mil TV TV /^<^7V ^ TV 

CTAACCACAT 


X CATC A rCCaA 


cACCCaAAAvaVa 


■ ini HI* TV TV ^fr»TV r^c^ 

X X TAAGTAcc 


f^r^^ TV /*f ^^^^^ 

CGGAGcGGcC 


1200 


ATTGTCTTCC 


TCTCCGCCTG 


CTACTTCATG 


va X%a\aC ACa X vao 


GC X ACCTvaTc 


GcGcAACTTC 


1260 


/■irtt/^^ JV/^ TV 7S 


TVi*t^TV/*lTVtH#'y»/^ 

ACaGAvjA X v.oC 


C XAaC^aACvaVaC 


C X VjC X oC X CC 


VavaVaAAAVaC X C 


CACGGG X CCG 


1320 


lJACTuTTGL.A 


CCCTGGTCTT 


CCXuCXCACC 


X AC X X C X X X ia 


GCA X VaVaCC X C 


G X CCA X C X GG 


1380 


I V3V9I3 i l3 A X C. V. 


XfJACl X XLIAC 


C X vau X X CC X \3 


VaCCoC X vaKa X C 


X vaAAVa X VaVavava 


C AA X GAGGCC 


1440 


A I cACCAACaC 


TV /-irp/-«/-i T^ rp T\ 

AO. rCCsC AI3 X A 


CX ICcAXClia 


UrCCVaCC X ova X 


XVaAX XCCCAC 


X G X ccAG rcc 


1500 


1 V9V3L.L.i3 1 AL. 


X X VsL. X V. X U 


nnr*nnTi~*r* a t 

V9i9Cur\3 X V3\aA X 


nncn a tpp p a 

uVaCu A X CC CA 


X X C X VaVaVaC A X 


C X vaC X AX va X G 


1 C f 


V3\3V^/i/\^\_ 1 W A 


a •pr*i^/'2<^ aTP a 

AXCCuVaAX CA 


PPTaJVaOAPP 
CC X ArwlVaACC 


X X X va X vav^ x \a\a 


P^^P P^SPTPTf 
wv*^ v.vav» X v^ X X 


AVa X X X ACC X C 




V3 X AM X V*\i3\3 wM 


pp a rrTTT'PT 

V.CACC X X Cv. X 


GATGGPPGGP 


'I'TTGTGT'PPP 
XXX va X w X v»v.>v« 


TPTTPPGPAT 
X v> X xwwvav«Avx 


PPGPTPGO TT 
V- v«va w X v^vava x x 


1 ^ 0 n 




av3\.9\jCuo X 0 X 


a a nr*^czn*v 

AV.3V3AVJC X X 


GTP A AGGPGG 
V3 X v»^u^vavaVa.\av3 


AP A A r! PT<^f5 A 


fsaaapTnaT/^ 

VaAAAC X V3A X \jr 


X / ^ u 


a TO a r!f2 a TT<^ 


\3CA 1 CI 1 C X C 


OVJ 1 \,7C X W X r\\^ 


APRGTGPPGG 
^^v.V3V3 X vav^^^vava 


PPAPPATAGT 


X n X V. VjVJXn X 0 X 


X D u u 


1 AC CI V3 X AL. V7 


AAtjCAtjCC X A 


C X 1 X VaAljWaAC 


1 Vsox-V X V-MnVava 


pppTftinpPTr! 

wv^v. X VaVaCC X va 


1 CCAXCrCOCC 


T Q c n 
X 0 0 u 


CAGGTGAAGG 


GTCCCGGCAA 


GAAGCCTCTC 


TACTCGGTCC 


TGATGCTCAA 


GTACTTCATG 


1920 


GCCCTGGCCG 


TGGGCATCAC 


CTCGGGCGTG 


TGGATCTGGT 


CTGGCAAGAC 


GCTGGAGAGC 


1980 


TGGCGACGCT 


TCTGGCGGAG 


ACTCCTAGGA 


GCGCCGGACC 


GCACGGGCGC 


CAACCAGCTG 


2040 


GCGATCAAGC 


AGCGGCCTCC 


GATCCCGCAT 


CCCTATGCCG 


GATCTGGAAT 


GGGCATGCCC 


2100 
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GTGGGCTCGG CGGCGGGCTC CCTGCTGGCC ACGCCCTACA CCCAGGCGGG CGGACGTCGG 2160 

TGGCCTCCAC CAGCCACCAC CACCTGCACC ACCACGTTCT CAAGCAGCCG GCGGCCAGCC 2220 

ACGTATGACA TGGAGAGTCG GGGGGAGCAT CGACCATGGG CGGCGGTGGG GGCGGCGGTA 2280 

CAGCCCTTG6 CGGCGGCACC CTGGGCCACG GCACCGCGAT GAGCAGCAGC ACGGTCGGCA 2340 

TGGG 2344 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 694 amino acids 

(B) TYPE: amino acid 

(C) STRAIODEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

<iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Dfz2 Polypeptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Arg His Asn Arg Leu Lys Val Leu lie Leu Gly Leu Val Leu Leu 
15 10 15 

Leu Thr Ser Cys Arg Ala Asp Gly Pro Leu His Ser Ala Asp His Gly 

20 25 30 

Met Gly Gly Met Gly Met Gly Gly His Gly Leu Asp Ala Ser Pro Ala 
35 40 45 

Pro Gly Tyr Gly Val Pro Ala lie Pro Lys Asp Pro Asn Leu Arg Cys 
50 55 60 

Glu Glu lie Thr lie Pro Met Cys Arg Gly lie Gly Tyr Asn Met Thr 
65 70 75 80 

Ser Phe Pro Asn Glu Met Asn His Glu Thr Gin Asp Glu Ala Gly Leu 
85 90 95 

Glu Val His Gin Phe Trp Pro Leu Val Glu lie Lys Cys Ser Pro Asp 
100 105 110 

Leu Lys Phe Phe Leu Cys Ser Met Tyr Thr Pro lie Cys Leu Glu Asp 
115 120 125 

Tyr His Lys Pro Leu Pro Val Cys Arg Ser Val Cys Glu Arg Ala Arg 
130 135 140 

Ser Gly Cys Ala Pro lie Met Gin Gin Tyr Ser Phe Glu Trp Pro Glu 
145 150 155 160 

Arg Met Ala Cys Glu His Leu Pro Leu His Gly Asp Pro Asp Asn Leu 
165 170 175 

Cys Met Glu Gin Pro Ser Tyr Thr Glu Ala Gly Ser Gly Gly Ser Ser 
ISO 185 190 

Gly Gly Ser Gly Gly Ser Gly Ser Gly Ser Gly Ser Gly Gly Lys Arg 
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195 



200 



205 



Lys Gin Gly Gly Ser Gly Ser Gly Gly Ser Gly Ala Gly Gly Ser Ser 
210 215 220 

Gly Ser Thr Ser Thr Lys Pro Cys Arg Gly Arg Asn Ser Lys Asn Cys 
225 230 235 240 

Gin Asn Pro Gin Gly Glu Lys Ala Ser Gly Lys Glu Cys Ser Cys Ser 
245 250 255 

Cys Arg Ser Pro Leu lie Phe Leu Gly Lys Glu Gin Leu Leu Gin Gin 
260 265 270 

Gin Ser Gin Met Pro Met Met His His Pro His His Trp Tyr Met Asn 
275 280 285 

Leu Thr Val Gin Arg He Ala Gly Val Pro Asn Cys Gly He Pro Cys 
290 295 300 

Lys Gly Pro Phe Phe Ser Asn Tlsp Glu Lys Asp Phe Ala Gly Leu Trp 
305 310 315 320 

He Ala Leu Trp Ser Gly Leu Cys Phe Cys Ser Thr Leu Met Thr Leu 
325 330 335 

Thr Thr Phe He He Asp Thr Glu Arg Phe Lys Xaa Pro Gly Ala Ala 
340 345 350 

He Val Phe Leu Ser Ala Cys Tyr Phe Met Val Ala Val Gly Tyr Leu 
355 360 365 

Ser Arg Asn Phe Leu Gin Asn Glu Glu He Ala Cys Asp Gly Leu Leu 
370 375 380 

Leu Arg Glu Ser Ser Thr Gly Pro His Ser Cys Thr Leu Val Phe Leu 
385 390 395 400 

Leu Thr Tyr Phe Phe Gly Met Ala Ser Ser He Trp Trp Val He Leu 
405 410 415 

Thr Phe Thr Trp Phe Leu Ala Ala Gly Leu Lys Trp Gly Asn Glu Ala 
420 425 430 

He Thr Lys His Ser Gin Tyr Phe His Leu Ala Ala Trp Leu He Pro 
435 440 445 

Thr Val Gin Ser Val Ala Val Leu Leu Leu Ser Ala Val Asp Gly Asp 
450 455 460 

Pro He Leu Gly He Cys Tyr Val Gly Asn Leu Asn Pro Asp His Leu 
465 470 475 480 

Lys Thr Phe Val Leu Ala Pro Leu Phe Val Tyr Leu Val He Gly Thr 
485 490 495 

Thr Phe Leu Met Ala Gly Phe Val Ser Leu Phe Arg He Arg Ser Val 
500 505 510 

He Lys Gin Gin Gly Gly Val Gly Ala Gly Val Lys Ala Asp Lys Leu 
515 520 525 

Glu Lys Leu Met He Arg He Gly He Phe Ser Val Leu Tyr Thr Val 
530 535 540 

Pro Ala Thr He Val He Gly Cys Tyr Leu Tyr Glu Ala Ala Tyr Phe 
545 550 555 560 
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Glu Asp Trp lie Lys Ala Leu Ala Cys Pro Cys Ala Gin Val Lys Gly 
565 570 575 

Pro Gly Lys Lys Pro Leu Tyr Ser Val Leu Met Leu Lys Tyr Phe Met 
580 585 590 

Ala Leu Ala Val Gly He Thr Ser Gly Val Trp He Trp Ser Gly Lys 
595 600 605 

Thr I*eu Glu Ser Trp Arg Arg Phe Trp Arg Arg Leu Leu Gly Ala Pro 
610 615 620 

Asp Arg Thr Gly Ala Asn Gin Ala Leu lie Lys Gin Arg Pro Pro He 
625 630 635 640 

Pro His Pro Tyr Ala Gly Ser Gly Met Gly Met Pro Val Gly Ser Ala 
645 650 655 

Ala Gly Ser Leu Leu Ala Thr Pro Tyr Thr Gin Ala Gly Gly Ala Ser 
660 665 670 

Val Ala Ser Thr Ser His His His Leu His His His Val Leu Lys Gin 
675 680 685 

Pro Ala Ala Ser His Val 
690 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2624 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Mus musculus friz2led-3 protein. 
Coding Region: 313.. 2313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GAATTCGGCA CGAGAAGATG GAATCTGTGA TTT6GGAATG CGGTTGATGG AGTTGCTATG 60 

CTGGCCAGAT GTGCCCAATG TAATAAAATG AAAAGAAGAT ACAAGATGAT GTCATCTTCC 120 

CATATTGTGA AACCAAAAAC AAATGCCCTT TGTGAGACCA GGTTACCAGT TCTTTGACAG 180 

TACAGGGAGT TTTTAAACTG AGGAGCCTAA CAGATAAGGG GTACTTTCAA GCTGAGACCT 24 0 

GCAGGCATAT ACTGATCTAA AACGCATCTT GTGTAGATCT GATCATCCGA GCCTCATTCT 300 

GATCCAGGAA GAATGGCTGT GAGCTGGATT GTCTTTGATC TTTGGCTCTT GACTGTGTTT 360 

CTGGGGCAGA TAGGTGGGCA CAGTTTGTTT TCTTGTGAAC CTATAACCTT GAGGATGTGC 420 

CAAGATTTGC CTTACAATAC TACCTTCATG CCTAATCTTC TGAACCATTA TGACCAACAG 4 80 

ACTGCAGCTT TAGCAATGGA GCCCTTCCAC CCTATGGTGA ACCTGGATTG TTCTCGGGAT 54 0 

TTTCGGCCAT TTCTTTGTGC ACTCTATGCC CCTATTTGTA TGGAATATGG ACGTGTCACA 600 
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CTTCCCTGCC 


GTAGGCTGTG 


TCAGCGTGCC 


TATAGCGAGT 


GTTCAAAACT 


CATGGAGATG 


660 


TTTGGTGTCC 


CGTGGCCTGA 


AGATATGGAG 


TGCAGTAGGT 


TTCCAGATTG 


TGATGAGCCA 


720 


TATCCCCGAC 


TTGTGQATTT 


GAATTTAGTT 


GGAGATCCAA 


CTGAAGGAGC 


CCCAGTTGCA 


780 


QTGCAGAGGG 


ACTATGGTTT 


TTGGTGTCCC 


AGAGAGTTAA 


AAATTGATCC 


TGATCTTGGC 


840 


TATTCCTTTC 


TGCACGTGCG 


AGATTGTTCG 


CCACCATGTC 


CCAATATGTA 


CTTCAGGAGA 


900 


GAAGAACTGT 


CATTTGCTCG 


CTATTTCATA 


GGCCTGATTT 


CAATCATTTG 


CCTCTCTGCC 


960 


ACATTGTTTA 


CTTTTTTAAC 


CTTTCTAATT 


GACGTCACAA 


GATTCCGTTA 


CCCTGAAAGA 


1020 


CCTATCATAT 


TTTATGCAGT 


CTGCTACATG 


ATGGTGTCAT 


TAATTTTCTT 


CATTGGGTTT 


1080 


TTGCTGGAGG 


ACCGAGTAGC 


CTGCAATGCA 


TCTAGCCCTG 


CACAGTATAA 


GGCTTCTACA 


1140 


GTGACACAAG 


GATCTCACAA 


TAAGGCCTGT 


ACCATGCTCT 


TTATGGTACT 


ATATTTTTTC 


1200 


ACTATGGCTO 


GCAGTGTATG 


GTGGGTAATT 


CTTACCATCA 


CATGGTTTTT 


AGCAGCTGTG 


1260 


CCAAAGTGGG 


GCAGTGAAGC 


TATTGAGAAG 


AAAGCATTGC 


TGTTTCATGC 


CAGTGCCTGG 


1320 


GGCATCCCCG 


GAACTCTAAC 


TATCATCCTT 


TTAGCGATGA 


ATAAAATTGA 


AGGTGACAAT 


1380 


ATTAGTGGCG 


TGTGTTTTGT 


CGGCCTCTAC 


GACGTTGATG 


CATTAAGATA 


TTTCGTTCTC 


1440 


GCTCCCCTCT 


GCCTGTATGT 


GGTAGTTGGG 


GTTTCTCTCC 


TTTTAGCCGG 


CATTATATCC 


1500 


CTAAACAGAG 


TTCGGATTGA 


QATCCCATTA 


GAAAAGGAAA 


ACCAAGATAA 


GTTAGTGAAG 


1560 


TTCATGATCC 


GGATTGGTGT 


TTTCAGCATT 


CTCTACCTTG 


TGCCACTCTT 


GGTTGTAATT 


1620 


GGATGTTACT 


TTTATGAGCA 


AGCTTACCGC 


GGCATCTGGG 


AGACAACATG 


GATCCAGGAA 


1680 


CGCTGCAGAG 


AGTATCACAT 


TCCATGTCCG 


TACCAGGTTA 


CTCAGATGAG 


TCGTCCAGAC 


1740 


CTGATTCTCT 


TTCTGATGAA 


GTATCTCATG 


GCTCTCATAG 


TTGGGATTCC 


CTCTATATTT 


1800 


TGGGTTGGAA 


GCAAAAAGAC 


ATGCTTTGAA 


TGGGCCAGTT 


TTTTCCATGG 


GCGTAGGAAA 


1860 


AAAGAGATAG 


TQAATGAGAG 


CCGGCAGGTG 


CTCCAGGAAC 


CTGACTTTGC 


TCAGTCACTC 


1920 


CTGAGGGACC 


CAAATACTCC 


AATTATAAGA 


AAATCAAGAG 


GAACTTCCAC 


TCAAGGGACA 


1980 


TCCACACATG 


CTTCTTCAAC 


TCAGCTGGCC 


ATGGTGGATG 


ACCAAAGAAG 


CAAAGCAGGG 


2040 


AGTGTCCACA 


GCAAAGTGAG 


CAGCTACCAT 


GGCAGCCTCC 


ACAGGTCACG 


GGATGGCAGG 


2100 


TACACTCCCT 


GCAGTTACCG 


AGGAATGGAG 


GAGAGACTAC 


CTCACGGCAG 


CATGTCACGG 


2160 


CTGACGGATC 


ATTCCAGGCA 


CAGTAGTTCT 


CATCGGCTCA 


ACGAGCAGTC 


CCGACACAGC 


2220 


AGCATCCGAG 


ACCTCAGTAA 


CAACCCCATG 


ACTCACATTA 


CACATGGCAC 


CAGCATGAAC 


2280 


CGTGTTATTG 


AGGAGGATGG 


AACCAGTGCT 


TAGTCTTGTC 


TAAGGTGAAA 


TGTGTGCTGT 


2340 


TGAAAAGCAG 


GTTTTGCCTT 


CGCATGGCTG 


GCTGCTGTAA 


CTCACTGTCG 


CTCTGCTTTC 


2400 


TTGGGCAGAG 


TGTCAGCCTG 


GGAAAGTAGA 


TCTTTGCTCT 


TTGTATCACA 


TCAACCCTGG 


2460 


GGTGTGAACA 


CATCCAAACC 


CTAAGGATCA 


TGTCATCACA 


AAAGTAATTC 


TTTCTAGGCT 


2520 


GTGAAGAGAT 


GATTGTCTGG 


TGAGCATTTT 


TTATAAACAT 


GCTTATTTTA 


TATCTAGAAA 


2580 


AATCCTCTAT 


GTGTGGTGAC 


TGCTTTGTAG 


TGAATTTCAT 


ATAA 




2624 
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(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 667 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii> HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Mfz3 protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Ala Val Ser Trp lie Val Phe Asp Leu Trp Leu Leu Thr Val Phe 
15 10 15 

Leu Gly Gin lie Gly Gly His Ser Leu Phe Ser Cys Glu Pro lie Thr 
20 25 30 

Leu Arg Met Cys Gin Asp Leu Pro Tyr Asn Thr Thr Phe Met Pro Asn 
35 40 45 

Leu Leu Asn His Tyr Asp Gin Gin Thr Ala Ala Leu Ala Met Glu Pro 
50 55 60 

Phe His Pro Met Val Asn Leu Asp Cys Ser Arg Asp Phe Arg Pro Phe 
65 70 75 80 

Leu Cys Ala Leu Tyr Ala Pro lie Cys Met Glu Tyr Gly Arg Val Thr 
85 90 95 

Leu Pro Cys Arg Arg Leu Cys Gin Arg Ala Tyr Ser Glu Cys Ser Lys 
100 105 110 

Leu Met Glu Met Phe Gly Val Pro Trp Pro Glu Asp Met Glu Cys Ser 
115 120 125 

Arg Phe Pro Asp Cys Asp Glu Pro Tyr Pro Arg Leu Val Asp Leu Asn 
130 135 140 

Leu Val Gly Asp Pro Thr Glu Gly Ala Pro Val Ala Val Gin Arg Asp 
145 150 155 160 

Tyr Gly Phe Trp Cys Pro Arg Glu Leu Lys lie Asp Pro Asp Leu Gly 
165 170 175 

Tyr Ser Phe Leu His Val Arg Asp Cys Ser Pro Pro Cys Pro Asn Met 
180 185 190 

Tyr Phe Arg Arg Glu Glu Leu Ser Phe Ala Arg Tyr Phe lie Gly Leu 
195 200 205 

lie Ser lie lie Cys Leu Ser Ala Thr Leu Phe Thr Phe Leu Thr Phe 
210 215 220 

Leu lie Asp Val Thr Arg Phe Arg Tyr Pro Glu Arg Pro lie lie Phe 
225 230 235 240 

Tyr Ala Val Cys Tyr Met Met Val Ser Leu lie Phe Phe He Gly Phe 
245 250 255 
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Leu Leu Glu Asp Arg Val Ala Cys Asn Ala Ser Ser Pro Ala Gin Tyr 
260 265 270 

Lys Ala Ser Thr Val Thr Gin Gly Ser His Asn Lys Ala Cys Thr Met 
275 280 285 

Leu Phe Met Val Leu Tyr Phe Phe Thr Met Ala Gly Ser Val Trp Trp 
290 295 300 

Val lie Leu Thr lie Thr Trp Phe Leu Ala Ala Val Pro Lys Trp Gly 
305 310 315 320 

Ser Glu Ala lie Glu Lys Lys Ala Leu Leu Phe His Ala Ser Ala Trp 
325 330 335 

Gly He Pro Gly Thr Leu Thr He He Leu Leu Ala Met Asn Lys He 
340 345 350 

Glu Gly Asp Asn He Ser Gly Val Cys Phe Val Gly Leu Tyr Asp Val 
355 360 365 

Asp Ala Leu Arg Tyr Phe Val Leu Ala Pro Leu Cys Leu Tyr Val Val 
370 375 380 

Val Gly Val Ser Leu Leu Leu Ala Gly He He Ser Leu Asn Arg Val 
385 390 395 400 

Arg He Glu He Pro Leu Glu Lys Glu Asn Gin Asp Lys Leu Val Lys 
405 410 415 

Phe Met He Arg He Gly Val Phe Ser He Leu Tyr Leu val Pro Leu 
420 425 430 

Leu Val Val He Gly Cys Tyr Phe Tyr Glu Gin Ala Tyr Arg Gly He 
435 440 445 

Trp Glu Thr Thr Trp He Gin Glu Arg Cys Arg Glu Tyr His He Pro 
450 455 460 

Cys Pro Tyr Gin Val Thr Gin Met Ser Arg Pro Asp Leu He Leu Phe 
465 470 475 480 

Leu Met Lys Tyr Leu Met Ala Leu He Val Gly He Pro Ser He Phe 
485 490 495 

Trp Val Gly Ser Lys Lys Thr Cys Phe Glu Trp Ala Ser Phe Phe His 
500 505 510 

Gly Arg Arg Lys Lys Glu He Val Asn Glu Ser Arg Gin Val Leu Gin 
515 520 525 

Glu Pro Asp Phe Ala Gin Ser Leu Leu Arg Asp Pro Asn Thr Pro He 
530 535 540 

He Arg Lys Ser Arg Gly Thr Ser Thr Gin Gly Thr Ser Thr His Ala 
545 550 555 560 

Ser Ser Thr Gin Leu Ala Met Val Asp Asp Gin Arg Ser Lys Ala Gly 

565 570 575 

Ser Val His Ser Lys Val Ser Ser Tyr His Gly Ser Leu His Arg Ser 
580 585 590 

Arg Asp Gly Arg Tyr Thr Pro Cys Ser Tyr Arg Gly Met Glu Glu Arg 
595 600 605 

Leu Pro His Gly Ser Met Ser Arg Leu Thr Asp His Ser Arg His Ser 
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610 615 620 

Ser Ser His Arg Leu Asn Glu Gin Ser Arg His Ser Ser lie Arg Asp 
625 630 €35 640 

I^eu Ser Asn Asn Pro Met Thr His lie Thr His Gly Thr Ser Met Asn 
645 650 655 

Arg Val lie Glu Glu Asp Gly Thr Ser Ala Glx 
660 665 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1770 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Caenorhabditis elegans putative 
transmembrouie receptor (frizzled 1) gene. 
Coding region: 57.-1634 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



GAATTCGGTT 


TAATTACCCA 


AGTTTGAGCT 


GTGAGCCCCC 


AATTCCATTA 


TCATTAATGG 


60 


GACCATTTCG 


TGGTTACCTC 


GGAGTAACCT 


GGCTCCTGTT 


GCTCTTTGTG 


ATTGGTGTGG 


120 


ACGGGCAGAG 


GTGTCAAAAG 


GTGGATCATG 


AGATGTGCAA 


CGATTTGCCG 


TATAACTTAA 


180 


CGAGCTTCCC 


AAATCTCGTC 


GACGAGGAAT 


CATGGAAAGA 


CGCCTCCGAA 


TCCATCCTCA 


240 


CCTACAAGCC 


CCTGCTCTCC 


GTTGTCTGCT 


CCGAGCAGCT 


CAAATTCTTC 


CTGTGCTCCG 


300 


TCTACTTCCC 


GATGTGCAAC 


GAGAAACTAG 


CCAACCCAAT 


TGGTCCATGC 


CGTCCATTGT 


360 


GTCTTTCCGT 


CCAGGAAAAG 


TGTCTTCCAG 


TGCTGGAAAG 


TTTCGGTTTC 


AAGTGGCCCG 


420 


ATGTGATTCG 


TTGTGATAAG 


TTCCCGTTGG 


AGAACAATCG 


AGAGAAAATG 


TGCATGAAAG 


480 


GGCCAAATGA 


GCAAGGAGCA 


ATTCAAGATG 


AGAGGGCAAA 


GTTTGCAGCG 


AAAGAAAGTG 


540 


AGGACGACGG 


TAATGATCGA 


GTAGAAGATA 


TTCAACGGGA 


GGTCGACCGC 


CTCAACGGAA 


600 


AATGCCCACA 


GGATGAGGTG 


TTCCTGAATC 


GATCCTCAAA 


GTGTGTGCCT 


TTGTGCTCGA 


660 


ACCCACAGAA 


GGTTGGGCAG 


ACTGACCGTG 


AATCCGCCAC 


CCGACTCTTG 


TTGTTTCTCT 


720 


CGCTGAGCTC 


TGTAATACTA 


ACAATTCTAT 


CAGTCTTCAT 


AGTCGGCTTA 


TCACGTCTCG 


780 


AGATGCTCCA 


CTCACTTACG 


GAAACTGCCA 


TGTTCTTCTC 


GTGCATCTCG 


TTTTGTGCGA 


840 


CATCGGTTAT 


TTATATTGTG 


AGCATTTCGT 


TTAAAGATCA 


GTTCCAAATC 


TCGTGCACCG 


900 


ACTACACCCA 


TCACCTGCTC 


TTCGTCGTCG 


GAGGGCTTTC 


CCATGTTCCA 


TGTTCTTCAG 


960 


TGGCCTCACT 


GATTTACTAC 


ACGGCAACTT 


GCTCACGTCT 


CTGGTGGCTC 


TTGATCTGTG 


1020 
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TGTCGTGGAA TAAGGCGACA AGGACATCGC ATATATTGGA CGACTCCAGA ACCCGCGTGA 1080 

TCATGCTCAT CCTGGGAATC CCGCTGGCTC CACTAATGCT CGCGCTACTC GCAAAAGCCG 1140 

TCGCCGCCAA TCCCCTCACC GGACTCTGCT TCATCGGAGC AGCAAGCCCG GGCACCGACT 1200 

GGATCTTCAA CTTCTGCCGG GAGCTCATTC TATTCCTCAT CAGCTCCATT GCTCTTTCGT 1260 

CTGCTTGCTG CCGGCTTCTG GGCTCTGATG AGCAGGATGT CAATGGGTTT GCCGGAGTCA 1320 

TTGCGGCAGT CTATCCGATT GCTGGACTAT TCTACATGCT TTCATTTGTG AACGATGCCA 1380 

CCCAACCGTT TCTCTCACTT GACAGAAGTT TCAATGCGGT CTCGGCGACC AAGTTCTCGT 144 0 

TTGATCTACT TTTGAGCTTC ATCATGTGCG CGTTTTGTCT TATTTACTTG CTGTTCAAGC 1500 

TGACTAGATC CTCATCAAAA GTTAGCAAAG AAGGATATCA ACCGGCGGTG CCGAAACTCC 1560 

CGCAACCGGC AATTCCCGGC AGTGTACGTT CGAACACCTA CGCGTCGACG TTTCGAACTA 1620 

ATAATATGAT TTGAAGGATT TTCAATAATT TTTTGTGAAA AACAACGGGT TTATATAGAT 1680 

AGAAAACAAA AAGGTGGTCT CAATTTTTTT TCCGTGAAAA TAAATTTTTA TTGATTTTTA 1740 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1770 
<2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 526 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Cfzl protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Gly Pro Phe Arg Gly Tyr Leu Gly Val Thr Trp Leu Leu Leu Leu 
15 10 15 

Phe Val lie Gly Val Asp Gly Gin Arg Cys Gin Lys Val Asp His Glu 
20 25 30 

Met Cys Asn Asp Leu Pro Tyr Asn Leu Thr Ser Phe Pro Asn Leu Val 
35 40 45 

Asp Glu Glu Ser Trp Lys Asp Ala Ser Glu Ser lie Leu Thr Tyr Lys 
50 55 60 

Pro Leu Leu Ser Val Val Cys Ser Glu Gin Leu Lys Phe Phe Leu Cys 
65 70 75 80 

Ser Val Tyr Phe Pro Met Cys Asn Glu Lys Leu Ala Asn Pro lie Gly 
85 90 95 

Pro Cys Arg Pro Leu Cys Leu Ser Val Gin Glu Lys Cys Leu Pro Val 
100 105 110 
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Leu Glu Ser Phe Gly Phe Lys Trp Pro Asp Val He Arg Cys Asp Lys 
lis 120 125 

Phe Pro Leu Glu Asn Asn Arg Glu Lys Met Cys Met Lys Gly Pro Asn 
130 135 140 

Glu Gin Gly Ala He Gin Asp Glu Arg Ala Lys Phe Ala Ala Lys Glu 
145 150 155 160 

Ser Glu Asp Asp Gly Asn Asp Arg Val Glu Asp He Gin Arg Glu Val 
165 170 175 

Asp Arg Leu Asn Gly Lys Cys Pro Gin Asp Glu Val Phe Leu Asn Arg 
180 185 190 

Ser Ser Lys Cys Val Pro Leu Cys Ser Asn Pro Gin Lys Val Gly Gin 
195 200 205 

Thr Asp Arg Glu Ser Ala Thr Arg Leu Leu Leu Phe Leu Ser Leu Ser 
210 215 220 

Ser Val lie Leu Thr He Leu Ser Val Phe He Val Gly Leu Ser Arg 
225 230 235 240 

Leu Glu Met Leu His Ser Leu Thr Glu Thr Ala Met Phe Phe Ser Cys 
245 250 255 

He Ser Phe Cys Ala Thr Ser Val He Tyr He Val Ser He Ser Phe 
260 265 270 

Lys Asp Gin Phe Gin He Ser Cys Thr Asp Tyr Thr His His Leu Leu 
275 280 285 

Phe Val Val Gly Gly Leu Ser His Val Pro Cys Ser Ser Val Ala Ser 
290 295 300 

Leu He Tyr Tyr Thr Ala Thr Cys Ser Arg Leu Trp Trp Leu Leu He 
305 310 315 320 

Cys Val Ser Trp Asn Lys Ala Thr Arg Thr Ser His He Leu Asp Asp 
325 330 335 

Ser Arg Thr Arg Val He Met Leu He Leu Gly He Pro Leu Ala Pro 
340 345 350 

Leu Met Leu Ala Leu Leu Ala Lys Ala Val Ala Ala Asn Pro Leu Thr 
355 360 365 

Gly Leu Cys Phe He Gly Ala Ala Ser Pro Gly Thr Asp Trp He Phe 
370 375 380 

Asn Phe Cys Arg Glu Leu He Leu Phe Leu He Ser Ser He Ala Leu 
385 390 395 400 

Ser Ser Ala Cys Cys Arg Leu Leu Gly Ser Asp Glu Gin Asp Val Asn 
405 410 415 

Gly Phe Ala Gly Val He Ala Ala Val Tyr Pro He Ala Gly Leu Phe 
420 425 430 

Tyr Met Leu Ser Phe Val Asn Asp Ala Thr Gin Pro Phe Leu Ser Leu 
435 440 445 

Asp Arg Ser Phe Asn Ala Val Ser Ala Thr Lys Phe Ser Phe Asp Leu 
450 455 460 

Leu Leu Ser Phe He Met Cys Ala Phe Cys Leu He Tyr Leu Leu Phe 
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465 470 475 480 

l»ys Leu Thr Arg Ser Ser Ser Lys Val Ser Lys Glu Gly Tyr Gin Pro 
485 490 495 

Ala Val Pro Lys Leu Pro Gin Pro Ala lie Pro Gly Ser Val Arg Ser 
500 505 510 

Asn Thr Tyr Ala Ser Thr Phe Arg Thr Asn Asn Met lie Glx 
515 520 525 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2828 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Mus musculus putative transmembrane 
receptor (frizzled 4) mRNA, 
Coding region: 238.. 1941 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



TCGACCTCAA 


CACAAAGACC 


TGGGTCGTGA 


GACACACGCG 


TAGAGTCAGG 


CGGCTTCCCC 


60 


GAAAACCGGA 


CTCGGCCGGC 


GCCGAGTCTG 


GGTCCCCGCC 


TTCAACCATG 


ACCCTAGCAA 


120 


TCCATCCCTC 


GGCCCGGGCT 


CCGGACGTCT 


GATATTCCGC 


ACATTCTCGT 


ACAACTGCTG 


180 


GAGAGGCGAC 


TGCTGCCCCC 


TTGTCGCCCT 


TGGCGCCTTA 


CCGCATTCCC 


TATCCGGAGT 


240 


TGGGAGCAGC 


GCGGCCACCG 


GCGCCCCTGT 


GCAAACTGGG 


GGTGTCTGCT 


AGATCAGCCT 


300 


CTGCCGCTGC 


TGCCCGCAGC 


TCTGGCCATG 


GCCTGGCCGG 


GCACAGGGCC 


GAGCAGCCGG 


360 


GGGGCGCCTG 


GAGGCGTCGG 


GCTCAGGCTG 


GGGCTGCTGC 


TGCAGTTCCT 


CCTGCTCCTG 


420 


CGGCCGACAC 


TGGGGTTCGG 


GGACGAGGAG 


GAGCGGCGCT 


GCGACCCCAT 


CCGCATCGCC 


480 


ATGTGCCAGA 


ACCTCGGCTA 


CAACGTGACC 


AAGATGCCCA 


ACTTAGTGGG 


ACACGAGCTG 


540 


CAGACAGACG 


CCGAGCTGCA 


GCTGACAACT 


TTCACGCCGC 


TCATCCAGTA 


CGGCTGCTCC 


600 


AGCCAGCTGC 


AGTTCTTCCT 


TTGTTCGGTT 


TATGTGCCAA 


TGTGCACAGA 


GAAGATCAAC 


660 


ATCCCCATCG 


GCCCGTGCGG 


TGGCATGTGC 


CTTTCAGTCA 


AGAGACGCTG 


TGAACCAGTC 


720 


CTGAGAGAAT 


TTGGGTTTGC 


CTGGCCCGAC 


ACCCTGAACT 


GCAGCAAGTT 


CCCGCCCCAG 


780 


AACGACCACA 


ACCACATGTG 


CATGGAAGGA 


CCAGGTGATG 


AAGAGGTTCC 


CTTGCCCCAC 


840 


AAGACTCCCA 


TCCAGCCCGG 


GGAAGAGTGC 


CACTCCGTGG 


GAAGCAATTC 


TGATCAGTAC 


900 


ATCTGGGTGA 


AGAGGAGCCT 


GAACTGTGTT 


CTCAAGTGTG 


GCTACGATGC 


TGGCTTGTAC 


960 


AGCCGCTCAG 


CTAAGGAGTT 


CACGGATATT 


TGGATGGCTG 


TGTGGGCCAG 


CCTCTGCTTC 


1020 
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UW A \3^^WW A A V- 


CTGATTGATT 


CATCCAGGTT 


A AW A A#%WV-wA 


1 OftO 
X w 0 V 




X wAXAX X Xv« X 


\„Jv>a X AXoX VaV. 


TUTU a^paT'^n^p 

XAXAMXMX X X 


ATAGnaTTGr* 


TTa T a ' 1 J!' 1 ** r ' 

X XAXAX XvaX X 


X J.4 U 




♦piir!f2orv2ftn A 

X nUU w> vVivsuin 


a IVGf2 AT ATfr* 


XwXwAA A A AV7 


AAGAGGCGGf 


aG aGorr'GTT 






nnVSKSnwA X#%n 


CAAf?Af?AGGA 


TG'raCAATAA 


A A A A W A A A 


G anyj'i* a ^ttt 

V3tn A\a X Aw XXX 




X X Xww#W%X\W 


WW#%UV« A A 


A A WW A www A A 


ATTfT^GAOAf^ 


■pf AfTTGG' IT 

A V«i^%\» A A ^7w A A 


TTTGG^^ aG/^^ 

A A AV9wV»M,wWw 


1 0 n 


Vsl^n^ X ^.nAV> X 


urvndu X X \3tA 


]lGPr*A7*n[YSAA 
MVsImV.AX xuimm 


*ii^a^**i**i* 

t\ A \9 VwnwnO A X 


^ X X A X X X wV. A 


oaTfv^f^a^^^ 

UAX VA9IJAGV..C 


UoQ 


ll7V9i3U>XAX X 


^^v3U.>\Ls XoAA 


a unroll TTYSTr* 

iUlW>W»AX XwXVm 


AXV» X XA3AX X A 


TG aG ar^r a r**!* 

X li3A\9 A W X AV3 X 


GGAX GCwGAT 


1440 




GCTTGTGCTA 


X.GXXGGGAAU 


^a a a arr'Tar* 
wAAAAL.^ X AG 


TV m/^ / vip^ TV 

A^ Gv«\.CTCAv 


TGGCTTTGTG 


1500 


GTG6C7CC7C 


TCTTTACGTA 


X X X vaG XuAX X 


GGAAVcisC X G X 


XV.AX XGCGGk^ 


GGGTTTGGTG 


1560 


G^L.X XAI x\^J% 


TV 3111 TTf^fysTr* 

AAAX XCGGX^ 


^a a*TV"i"P/ 'a & 

WAilXV.X XWaA 


a a aGa f^/srsnia 


r'a a a/^a^ar*a 
V.AAAGACAGA 


^TV TV TV TV 

wAAGTTGGAA 


1620 


AGuCrxTAAT^vo 


X L.AAGAX vJGG 


GGTCTTCTCA 


G X Av» XvaXAv. A 


CGGTTCCTGC 


aacctgtgtg 


1680 


ATT6CCTGTT 


ATTTCTATGA 


T\ TV m^m^ TV TV TV ^ 

AATCTCAAAv. 


TGGGCACTCT 


TTCGATATTC 


«m »^Tv TV fnt^ TV ^*« 

TGCAGATGAC 


1740 


tcaaacatgg 


^^«v / nmm < TV Tk TV «t« 

CAGTTGAAAT 


^ tnwm < TV TV TV TV ffi*fn 

GTTGaAAATT 


TTTATGTCTT 


TGCTCGTGGG 


CATCACTTCA 


1800 


GG CATGTGG A 


TTTGGTCTGC 


CAAAACTCTT 


CACACGTGGC 


TV TV TV TV ;^**»TS^^ WfVIH<^ 

AAAAGTGTTC 


TAACCGATTG 


1860 


^^^^ 1^ ^^n*^^ ^^^^^^ 
G xGAAX 1 u XG 


GGAAGGTAAA 


GAGAGAGAAG 


Av^vsItGGAA i t» 


G I X GGGTGAA 


^^^TV/*^/^TV TV TV TV 

G CCAGGAAAA 


1920 


GGCAACGAGA 


C TG TGGT ATA 


TV TV / Tv ^ ^/^Z** 

AG ACTAG CCG 


GCTT CCTCGT 


TCCTCATTGT 


GAAGGAAGTG 


1980 


A xG AGGG AA 


X L. X C AG xT 1 G 


TV TV ^ TV TV Tl <*»>|WT» tv 


TV TV TV TV r**'!*' I 'r^ 

GAAALIAL>Tx 


TV f*r^r^ A /^TV TV 

AGC- CC ACACA 


CACCCACGTC 


2040 


Tl ^^^^ TV f^f~y TV y*v 

AGCCCACCAC 


CACTCACCCA 


TV ^t*t*^TV ^ TV rn/^ 

ACTCAGCATC 


TV TV TV TV /^r^TV TV 

AGAAGACCAA 


TGGCTTCACT 


GCAGACTTTG 


2100 


TV TV TV 

GAATGGTC C A 


TV TV TV rr^/^ TV TV TV TV 

AAATGGAAAA 


f^TS /^P 1 Vf 1 1 TV TV 

GCCAG X X AAG 


AGGTTTTCAA 


AGCTGTGAAA 


AATCAAAATG 


2160 


TTGATCACTX 


1 AGCAGGTCA 


CJAGv. X X GGAG 


X L.(.G X GGAGG 


1 CCCGCCTAG 


TV nWK.^^/ tflW * TV TV 

ATTCCTGAAG 


2220 


l*v»\JAGGG X GA 


X AG X o XXX GC 


X v.k. X AC X V9GG 


XGGGAX X XCA 


jpV ^^^^^^^^^^ R 

Av» X G X G AGTT 


TV IT TV TV TV tiM • ^ 

G AT AACATG C 


2280 


AA\^\vA.\3AAALv 


AX X AAX X X X X 


AAAAwCw XXX 


X AAAX X X XAA 


TV •pTV /-»nr»'n tv /"wpTV 
AXAGXAAL.XA 


AGGTCTTGCA 


2340 


GA X AG VAAAG 


XGAXuX AX AA 


AvJAlii* X GGAAA 


XGv-XGGGX XG 


GGAGACG X GT 


rpr^/^ TV TV J'liliiUlli 

TGCAGAGX X T 


2400 


TTATATGTTT 


CTGGXCTAAC 


TVmfV TV TV /^TV 111/ 

Ax AAAwiXv. X 


X C\iA3GUL. X AL. 


ACTGTCTGCT 


/■^Mim^nTv TV TV 

GTTTAGAACT 


2460 






TGGTfJTP A A A 
X vna X u X L^nMA 


A X CC X X WAw X 


wwV« X X l9 X X 


a a A TV r^7v/!!ii iv^ 
AAAAC«AGAAx 


2520 






f!TGTAr*TAA<** 


ACACGTAAGG 


TATPr*aGTGG 


AX X XCXWX\»iX 


0 c 0 rv 


CCTGAAATTT 




A A' 1 " 1 *f*T AfS^r* 

#M%A A W A^%wwW 


AGr*CCPTGTT 

^%V9W^WV>AwX A 


TTr^TTPa PTT 
X X W X X WA^ X X 


Ta a ar'Ta aT/** 

X AAAV. X AAXG 






AAAAAGGTTA 

A A#^ 


'rT'I'*l'*l*A'PA GG 

A A A A A#^A#%MW 


^%A AAAAAAAA 


Gr* AnTGr* a Gr* 

V9V»X%i\v X VJW/\0^ 


aTfirr'Ta aT^ 

AXGvU. X AAXG 




AGAGGAAAAG 


6A6GTGATCA 


CTTCTGACAA 


TCACTTAATT 


CAGAGAAAAA 


TGAGATTTGC 


2760 


TAATTGACTT 


ACCTTCCGAC 


CCCTAGAGAC 


CCTATTGCAT 


TAAGCAATGT 


TTAAGCAATT 


2820 


GGGGACTT 












2828 


(2) INFORMATION FOR SEQ ID NO; 8; 











{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 538 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: M£z4 protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Ala Trp Pro Gly Thr Gly Pro Ser Ser Arg Gly Ala Pro Gly Gly 
15 10 15 

Val Gly Leu Arg Leu Gly Leu Leu Leu Gin Phe Leu Leu Leu Leu Arg 
20 25 30 

Pro Thr Leu Gly Phe Gly Asp Glu Glu Glu Arg Arg Cys Asp Pro lie 
35 40 45 

Arg lie Ala Met Cys Gin Asn Leu Gly Tyr Asn Val Thr Lys Met Pro 
50 55 60 

Asn Leu Val Gly His Glu Leu Gin Thr Asp Ala Glu Leu Gin Leu Thr 
65 70 75 80 

Thr Phe Thr Pro Leu lie Gin Tyr Gly Cys Ser Ser Gin Leu Gin Phe 
85 90 95 

Phe Leu Cys Ser Val Tyr Val Pro Met Cys Thr Glu Lys He Asn He 
100 105 110 

Pro He Gly Pro Cys Gly Gly Met Cys Leu Ser Val Lys Arg Arg Cys 

115 120 125 

Glu Pro Val Leu Arg Glu Phe Gly Phe Ala Trp Pro Asp Thr Leu Asn 
130 135 140 

Cys Ser Lys Phe Pro Pro Gin Asn Asp His Asn His Met Cys Met Glu 
145 150 155 160 

Gly Pro Gly Asp Glu Glu Val Pro Leu Pro His Lys Thr Pro He Gin 
165 170 175 

Pro Gly Glu Glu Cys His Ser Val Gly Ser Asn Ser Asp Gin Tyr He 
180 185 190 

Trp Val Lys Arg Ser Leu Asn Cys Val Leu Lys Cys Gly Tyr Asp Ala 

195 200 205 

Gly Leu Tyr Ser Arg Ser Ala Lys Glu Phe Thr Asp He Trp Met Ala 
210 215 220 

Val Trp Ala Ser Leu Cys Phe He Ser Thr Thr Phe Thr Val Leu Thr 
225 230 235 240 

Phe Leu He Asp Ser Ser Arg Phe Ser Tyr Pro Glu Arg Pro He He 
245 250 255 

Phe Leu Ser Met Cys Tyr Asn He Tyr Ser He Ala Tyr He Val Arg 
260 265 270 

Leu Thr Val Gly Arg Glu Arg He Ser Cys Asp Phe Glu Glu Ala Ala 
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275 280 285 

Glu Pro Val Leu He Gin Glu Gly Leu Lys Asn Thr Gly Cys Ala He 
290 295 300 

He Phe Leu Leu Met Tyr Phe Phe Gly Met Ala Ser Ser He Trp Trp 
305 310 315 320 

Val He Leu Thr Leu Thr Trp Phe Leu Ala Ala Gly Leu Lys Trp Gly 
325 330 335 

His Glu Ala He Glu Met His Ser Ser Tyr Phe His He Ala Ala Trp 
340 345 350 

Ala He Pro Ala Val Lys Thr He Val He Leu He Met Arg Leu Val 
355 360 365 

Asp Ala Asp Glu Leu Thr Gly Leu Cys Tyr Val Gly Asn Gin Asn Leu 
370 375 380 

Asp Ala Leu Thr Gly Phe Val Val Ala Pro Leu Phe Thr Tyr Leu Val 
385 390 395 400 

He Gly Thr Leu Phe He Ala Ala Gly Leu Val Ala Leu Phe Lys He 
405 410 415 

Arg Ser Asn Leu Gin Lys Asp Gly Thr Lys Thr Asp Lys Leu Glu Arg 
420 425 430 

Leu Met Val Lys He Gly Val Phe Ser Val Leu Tyr Thr Val Pro Ala 
435 440 445 

Thr Cys Val He Ala Cys Tyr Phe Tyr Glu He Ser Asn Trp Ala Leu 
450 455 460 

Phe Arg Tyr Ser Ala Asp Asp Ser Asn Met Ala Val Glu Met Leu Lys 
465 470 475 480 

He Phe Met Ser Leu Leu Val Gly He Thr Ser Gly Met Trp He Trp 
485 490 495 

Ser Ala Lys Thr Leu His Thr Trp Gin Lys Cys Ser Asn Arg Leu Val 
500 505 510 

Asn Ser Gly Lys Val Lys Arg Glu Lys Arg Gly Asn Gly Trp Val Lys 
515 520 525 

Pro Gly Lys Gly Asn Glu Thr Val Val Glx 
530 535 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2334 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Human transmembrane receptor 

(frizzled 5) mRNA, Coding region: 321.. 2078 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



ACCCAG6GAC 


GGAGGACCCA 


GGCTGGCTTG 


GGGACTGTCT 


GCTCTTCTCG 


GCGGGAGCCG 


60 


TGGAGAGTCC 


TTTCCCTGGA 


ATCCGAGCCC 


TAACCGTCTC 


TCCCCAGCCC 


TATCCGGCGA 


120 


GGAGCGGAGC 


GCTGCCAGCG 


GAGGCAGCGC 


CTTCCCGAAG 


CAGTTTATCT 


TTGGACGGTT 


180 


TTCTTTAAAG 


GAAAAACGAA 


CCAACAGGTT 


GCCAGCCCCG 


GCGCCACACA 


CGAGACGCCG 


240 


GAGGGAGAAG 


CCCCGGCCCG 


GATTCCTCTG 


CCTGTGTGCG 


TCCCTCGCGG 


GCTGCTGGAG 


300 


GCGAGGGGAG 


GGAGGGGGCG 


ATGGCTCGGC 


CTGACCCATC 


CGCGCCGCCC 


TCGCTGTTGC 


360 


TGCTGCTCCT 


GGCGCAGCTG 


GTGGGCCGGG 


CGGCCGCCGC 


GTCCAAGGCC 


CCGGTGTGCC 


420 


AGGAAATCAC 


GGTGCCCATG 


TGCCGCGGCA 


TCGGCTACAA 


CCTGACGCAC 


ATGCCCAACC 


480 


AGTTCAACCA 


CGACACGCAG 


GACGAGGCGG 


GCCTGGAGGT 


GCACCAGTTC 


TGGCCGCTGG 


540 


TGGAGATCCA 


ATGCTCGCCG 


GACCTGCGCT 


TCTTCCTATG 


CACTATGTAC 


ACGCCCATCT 


600 


GTCTGCCCGA 


CTACCACAAG 


CCGCTGCCGC 


CCTGCCGCTC 


GGTGTGCGAG 


CGCGCCAAGG 


660 


CCGGCTGCTC 


GCCGCTGATG 


CGCCAGTACG 


GCTTCGCCTG 


GCCCGAGCGC 


ATGAGCTGCG 


720 


ACCGCCTCCC 


GGTGCTGGGC 


CGCGACGCCG 


AGGTCCTCTG 


CATGGATTAC 


AACCGCAGCG 


780 


AGGCCACCAC 


GGCGCCCCCC 


AGGCCTTTCC 


CAGCCAAGCC 


CACCCTTCCA 


GGCCCGCCAG 


840 


GGGCGCCGGC 


CTCGGGGGGC 


GAATGCCCCG 


CTGGGGGCCC 


GTTCGTGTG C 


AAGTGTCGCG 


900 


AGCCCTTCGT 


GCCCATTCTG 


AAGGAGTCAC 


ACCCGCTCTA 


CAACAAGGTG 


CGGACGGGCC 


960 


AGGTGCCCAA 


CTGCGCGGTA 


CCCTGCTACC 


AGCCGTCCTT 


CAGTGCCGAC 


GAGCGCACGT 


1020 


TCGCCACCTT 


CTGGATAGGC 


CTGTGGTCGG 


TGCTGTGCTT 


CATCTCCACG 


TCCACCACAG 


1080 


TGGCCACCTT 


CCTCATCGAC 


ATGGACACGT 


TCCGCTATCC 


TGAGCGCCCC 


ATCATCTTCC 


1140 


TGTCAGCCTG 


CTACCTGTGC 


GTGTCGCTGG 


GCTTCCTGGT 


GCGTCTGGTC 


GTGGGCCATG 


1200 


CCAGCGTGGC 


CTGCAGCCGC 


GAGCACAACC 


ACATCCACTA 


CGAGACCACG 


GGCCCTGCAC 


1260 


TGTGCACCAT 


CGTCTTCCTC 


CTGGTCTACT 


TCTTCGGCAT 


GGCCAGCTCC 


ATCTGGTGGG 


1320 


TCATCCTGTC 


GCTCACCTGG 


TTCCTGGCCG 


CCGCGATGAA 


GTGGGGCAAC 


GAGGCCATCG 


1380 


CGGGCTACGG 


CCAGTACTTC 


CACCTGGCTG 


CGTGGCTCAT 


CCCCAGCGTC 


AAGTCCATCA 


1440 


CGGCACTGGC 


GCTGAGCTCC 


GTGGACGGGG 


ACCCAGTGGC 


CGGCATCTGC 


TACGTGGGCA 


1500 


ACCAGAACCT 


GAACTCGCTG 


CGGCGCTTCG 


TGCTGGGCCC 


GCTGGTGCTC 


TACCTGCTGG 


1560 


TGGGCACGCT 


CTTCCTGCTG 


GCGGGCTTCG 


TGTCGCTCTT 


CCGCATCCGC 


AGCGTCATCA 


1620 


AGCAGGGCGG 


CACCAAGACG 


GACAAGCTGG 


AGAAGCTCAT 


GATCCGCATC 


GGCATCTTCA 


1680 


CGCTGCTCTA 


CACGGTCCCC 


GCCAGCATTG 


TGGTGGCCTG 


CTACCTGTAC 


GAGCAGCACT 


1740 


ACCGCGAGAG 


CTGGGAGGCG 


GCGCTCACCT 


GCGCCTGCCC 


GGGCCACGAC 


ACCGGCCAGC 


1800 


CGCGCGCCAA 


GCCCGAGTAC 


TGGGTGCTCA 


TGCTCAAGTA 


CTTCATGTGC 


CTGGTGGTGG 


1860 


GCATCACGTC 


GGGCGTCTGG 


ATCTGGTCGG 


GCAAGACGGT 


GGAGTCGTGG 


CGGCGTTTCA 


1920 
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CCAGCCGCTG CTGCTGCCGC CCGCGGCGCG GCCACAAGAG CGGGGGCGCC ATGGCCGCAG 1980 

GGGACTACCC CGAGGCGAGC GCCGCGCTCA CAGGCAGGAC CGGGCCGCCG GGCCCCGCCG 2040 

CCACCTACCA CAAGCAGGTG TCCCTGTCGC ACGTGTAGGA GGCTGCCGCC GAGGGACTCG 2100 

GCCGGAGA6C TGAGGGGAGG GGGGCGTTTT GTTTGGTAGT TTTGCCAAGG TCACTTCCGT 2160 

TTACCTTCAT GGTGCTGTTG CCCCCTCCCG CGGC6ACTTG GAGAGAGGGA AGAGGG6CGT 2220 

TTTCGAGGAA GAACCTGTCC CAGGTCTTCT CCAAGGGGCC CAGCTCACGT GTATTCTATT 2280 

TTGCGTTTCT TACCTGCCTT CTTTATGGQA ACCCTCTTTT TAATTTATAT GTAT 2334 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUKNCE CHARACTERISTICS: 

(A) LENGTH: 586 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: linear 

(ii) MOIiECUIiE TYPE: protein 

(iii) HYPOTHETICAI,: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hfz5 protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Arg Pro Asp Pro Ser Ala Pro Pro Ser Leu Leu Leu Leu Leu 
15 10 15 

Leu Ala Gin Leu Val Gly Arg Ala Ala Ala Ala Ser Lys Ala Pro Val 
20 25 30 

Cys Gin Glu lie Thr Val Pro Met Cys Arg Gly lie Gly Tyr Asn Leu 
35 40 45 

Thr His Met Pro Asn Gin Phe Asn His Asp Thr Gin Asp Glu Ala Gly 
50 55 60 

Leu Glu Val His Gin Phe Trp Pro Leu Val Glu lie Gin Cys Ser Pro 
65 70 75 80 

Asp Leu Arg Phe Phe Leu Cys Thr Met Tyr Thr Pro lie Cys Leu Pro 
85 90 95 

Asp Tyr His Lys Pro Leu Pro Pro Cys Arg Ser Val Cys Glu Arg Ala 
100 105 110 

Lys Ala Gly Cys Ser Pro Leu Met Arg Gin Tyr Gly Phe Ala Trp Pro 
115 120 125 

Glu Arg Met Ser Cys Asp Arg Leu Pro Val Leu Gly Arg Asp Ala Glu 
130 135 140 

Val Leu Cys Met Asp Tyr Asn Arg Ser Glu Ala Thr Thr Ala Pro Pro 
145 150 155 160 

Arg Pro Phe Pro Ala Lys Pro Thr Leu Pro Gly Pro Pro Gly Ala Pro 
165 170 175 
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Ala Ser Gly Gly Qlu Cys Pro Ala Gly Gly Pro Phe Val Cys Lys Cys 
180 185 190 

Arg Glu Pro Phe Val Pro lie Leu Lys Glu Ser His Pro Leu Tyr Asn 
195 200 205 

Lys Val Arg Thr Gly Gin Val Pro Asn Cys Ala Val Pro Cys Tyr Gin 
2X0 215 220 

Pro Ser Phe Ser Ala Asp Glu Arg Thr Phe Ala Thr Phe Trp lie Gly 
225 230 235 240 

Leu Trp Ser Val Leu Cys Phe lie Ser Thr Ser Thr Thr Val Ala Thr 
245 250 255 

Phe Leu lie Asp Met Asp Thr Phe Arg Tyr Pro Glu Arg Pro He He 
260 265 270 

Phe Leu Ser Ala Cys Tyr Leu Cys Val Ser Leu Gly Phe Leu Val Arg 
275 280 285 

Leu Val Val Gly His Ala Ser Val Ala Cys Ser Arg Glu His Asn His 
290 295 300 

He His Tyr Glu Thr Thr Gly Pro Ala Leu Cys Thr He Val Phe Leu 
305 310 3X5 320 

Leu Val Tyr Phe Phe Gly Met Ala Ser Ser He Trp Trp Val He Leu 
325 330 335 

Ser Leu Thr Trp Phe Leu Ala Ala Ala Met Lys Trp Gly Asn Glu Ala 
340 345 350 

He Ala Gly Tyr Gly Gin Tyr Phe His Leu Ala Ala Trp Leu He Pro 
355 360 365 

Ser Val Lys Ser He Thr Ala Leu Ala Leu Ser Ser Val Asp Gly Asp 
370 375 380 

Pro Val Ala Gly He Cys Tyr Val Gly Asn Gin Asn Leu Asn Ser Leu 
385 390 395 400 

Arg Arg Phe Val Leu Gly Pro Leu Val Leu Tyr Leu Leu Val Gly Thr 
405 410 415 

Leu Phe Leu Leu Ala Gly Phe Val Ser Leu Phe Arg He Arg Ser Val 
420 425 430 

He Lys Gin Gly Gly Thr Lys Thr Asp Lys Leu Glu Lys Leu Met He 
435 440 445 

Arg He Gly He Phe Thr Leu Leu Tyr Thr Val Pro Ala Ser He Val 
450 455 460 

Val Ala Cys Tyr Leu Tyr Glu Gin His Tyr Arg Glu Ser Trp Glu Ala 
465 470 475 480 

Ala Leu Thr Cys Ala Cys Pro Gly His Asp Thr Gly Gin Pro Arg AXa 
485 490 495 

Lys Pro Glu Tyr Trp Val Leu Met Leu Lys Tyr Phe Met Cys Leu Val 
500 505 510 

Val Gly He Thr Ser Gly Val Trp He Trp Ser Gly Lys Thr Val Glu 
5X5 520 525 

Ser Trp Arg Arg Phe Thr Ser Arg Cys Cys Cys Arg Pro Arg Arg Gly 
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530 535 540 

His Lys Ser Gly Gly Ala Met Ala Ala Gly Asp Tyr Pro Glu Ala Ser 
545 550 555 560 

Ala Ala Leu Thr Gly Arg Thr Gly Pro Pro Gly Pro Ala Ala Thr Tyr 
565 570 575 

His Lys Gin Val Ser Leu Ser His Val Glx 
580 585 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Mus musculus putative transmembrane 
receptor (frizzled 6) mRNA, Coding region: 146.. 2275 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



TCATTTCAGG 


CCCAGCTACT 


ATCAAAATGG 


TACAAAGAAT 


GCAATGAGGA 


ATTTGTACAT 


60 


TTTATCTCTG 


ATTTGAGAAT 


CTTTTTGATG 


CGGAAAGGAG 


CATAAGAATA 


ATCCAAGCCA 


120 


TGTGGTAAAA 


TCGGAGTCTG 


GCAAGATGGA 


AAGGTCCCCG 


TTTCTGTTG G 


CGTGCATTCT 


180 


TCTGCCCCTC 


GTAAGAGGAC 


ACAGCCTTTT 


CACCTGTGAG 


CCAATCACCG 


TTCCCAGATG 


240 


TATGAAAATG 


ACTTACAACA 


TGACGTTCTT 


CCCTAACCTG 


ATGGGTCATT 


ATGACCAGGG 


300 


GATCGCTGCT 


GTGGAAATGG 


GGCACTTTCT 


GCATCTTGCA 


AATCTAGAAT 


GTTCACCAAA 


360 


CATTGAAATG 


TTCCTTTGCC 


AAGCTTTTAT 


ACCAACCTGC 


ACAGAGCAAA 


TTCATGTAGT 


420 


TCTACCCTGT 


CGGAAATTGT 


GTGAGAAAAT 


AGTTTCTGAT 


TGCAAAAAAC 


TAATGGACAC 


480 


TTTTGGCATC 


CGATGGCCTG 


AAGAACTTGA 


ATGTAACAGA 


TTGCCACACT 


GTGATGACAC 


540 


TGTTCCTGTA 


ACTTCTCATC 


CACACACAGA 


GCTTTCTGGG 


CCACAGAAGA 


AATCAGATCA 


600 


AGTCCCAAGA 


GACATTGGAT 


TTTGGTGTCC 


AAAGCACCTT 


AGGACTTCCG 


GGGACCAAGG 


660 


CTATAGGTTT 


CTGGGAATTG 


AACAGTGTGC 


CCCTCCGTGC 


CCCAATATGT 


ATTTTAAAAG 


720 


TGATGAACTA 


GACTTTGCCA 


AAAGTTTCAT 


AGGAATAGTT 


TCAATATTTT 


GTCTTTGTG C 


780 


AACTCTGTTC 


ACGTTCCTTA 


CATTTTTAAT 


TGACGTTAGA 


CGATTCAGAT 


ACCCAGAGAG 


840 


ACCAATTATC 


TATTACTCTG 


TCTGCTACAG 


CATTGTCTCT 


CTCATGTACT 


TCGTGGGGTT 


900 


TTTGCTGGGC 


AATAGCACAG 


CTTGTAATAA 


GGCAGACGAG 


AAGCTGGAGC 


TCGGGGACAC 


960 


CGTTGTCCTA 


GGGTCAAAGA 


ATAAGGCTTG 


CAGTGTGGTA 


TTTATGTTTC 


TGTATTTTTT 


1020 


TACAATGGCT 


GGCACCGTGT 


GGTGGGTGAT 


TCTCACCATT 


ACGTGGTTCT 


TAGCTGCCGG 


1080 
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GA6AAAATGG 


AGTTGCGAAG 


CTAl IksAAUA 




Xljo 1 X v»\»/VX\A 


Uv-v»xT^GCCTG 


1140 


GGGGGCGCCC 


GGGTTCCTGA 


CCGTCATGCT 


GCTCGCTATG 


AA X AAVsVa X To 


AAGGAGACAA 


1200 


CATTAGCGGC 


GTTTG CTTCG 


TTGGCCTGTA 


TGACCTQaViAC 


varCv.. X V* i (-oL. 1 


ACTTCGTCCT 


1260 


TCTGCCTCTG 


TGCCTCTQCQ 


TATTTGTTGG 


OCTGTCTCTC 


CTCTTAG CCG 


GCATCATCTC 


1320 


CTTGAATCAT 


GTCCGACAAG 


TCATACAoCA 


X VsA X V9V3 1 


AA^ iwAAoAi? A 


TV^^f^TV TV TV^TV TV 

AvsUTAAAGAA 


1380 


ATTCATGATT 


CGCATCGGAG 


TCTTCAGTGG 


CCTGTATCTT 


(3Xu^UL.X lAu 


TGACACTTCT 


1440 


CGGTTGCTAT 


GTCTATGAGC 


TAGTGAACAG 


GATCACCTGG 


GAGATGACAT 


GGTTCTCTGA 


1500 


TCATTGTCAC 


CAGTACCGCA 


TCCCGTGCCC 


TTACCAGGCA 


AATCCAAAAG 


CTCGACCAGA 


1560 


ATTGGCTTTA 


TTTATGATAA 


AATATCTGAT 


GACATTAATT 


GTTGGTATCT 


CTGCGGTCTT 


1620 


CTGGGTTGGA 


AGCAAAAAGA 


CGTGCACAGA 


ATGGGCCGGG 


TTCTTTAAGC 


GAAACCGCAA 


1680 


GCGAGACCCC 


ATCAGTGAGA 


GCCGCCGAGT 


G CTGC AAG AG 


TCCTGTGAGT 


TCTTCCTGAA 


1740 


GCACAACTCT 


AAAGTGAAGC 


ACAAGAAGAA 


GCATGGCGCA 


CCAGGGCCTC 


ATAGGCTGAA 


1800 


GGTCATTTCC 


AAGTCCATGG 


GAACTAGCAC 


AGGAGCGACC 


ACAAATCATG 


GCACCTCTGC 


1860 


CATGGCAATC 


GCTGACCATG 


ATTACTTAGG 


GCAAGAAACT 


TCAACAGAAG 


TCCACACCTC 


1920 


CCCAGAAGCA 


TCCGTCAAAG 


AGGGACGAGC 


AGACCGAGCA 


AACACTCCCA 


GCGCCAAAGA 


1980 


TCGGGACTGT 


GGGGAATCTG 


CAGGGCCCAG 


TTCCAAGCTC 


TCTGGGAACC 


GGAACGGCAG 


2040 


GGAAAGCCGA 


GCGGGCGGCC 


TGAAGGAGAG 


AA6CAATGGA 


TCAGAGGGGG 


CTCCAAGTGA 


2100 


AGGAAGGGTA 


AGTCCAAAGA 


GCAGCGTTCC 


TGAGACTGGC 


CTGATAGACT 


GCAGCACTTC 


2160 


ACAGGCCGCC 


AGTTCTCCAG 


AACCAACCAG 


CCTCAAGGGC 


TCCACATCTC 


TGCCTGTTCA 


2220 


CTCTIGCTTCC 


AGAGCTAGGA 


AAGAGCAGGG 


TGCTGGCAGC 


CATTCCGACG 


CTT6AAGAAA 


2280 


ACTGTCTCGT 


TCCCCCAGAA 


GCACATGTAT 


GTTACACTGG 


9V ^ IV n^/^iv r^r^T^ iv 
AGATQACCAA 


CTGATTTGTC 


2340 


TTATAAAGGC 


CACTGTTGAG 


CTGGGAGAGT 


AGCCCAGTGG 


TACAGCGCCC 


ACCTGGAATA 


2400 


CTGAGGACCT 


GGGGTTGTCT 


CCCAGCACTG 


CAAAAGGAAA 


ATTCACTGTT 


ACAGTCTTCC 


2460 


TTGCACTTAA 


CCAGCTTTGT 


CTATGTTTTT 


TT 






2492 


(2) INFORMATION FOR SEQ ID NO: 12 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 710 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Mf26 protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
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Met Glu Arg Ser Pro Phe Leu Leu Ala Cys lie Leu Leu Pro Leu Val 
15 10 15 

Arg Gly His Ser Leu Phe Thr Cys Glu Pro lie Thr Val Pro Arg Cys 
20 25 30 

Met Lys Met Thr Tyr Asn Met Thr Phe Phe Pro Asn Leu Met Gly His 
35 40 45 

Tyr Asp Gin Gly lie Ala Ala Val Glu Met Gly His Phe Leu His Leu 
50 55 60 

Ala Asn Leu Glu Cys Ser Pro Asn lie Glu Met Phe Leu Cys Gin Ala 
65 70 75 80 

Phe He Pro Thr Cys Thr Glu Gin He His Val Val Leu Pro Cys Arg 
85 90 95 

Lys Leu Cys Glu Lys He Val Ser Asp Cys Lys Lys Leu Met Asp Thr 
100 105 110 

Phe Gly He Arg Trp Pro Glu Glu l-eu Glu Cys Asn Arg Leu Pro His 
115 120 125 

Cys Asp Asp Thr Val Pro Val Thr Ser His Pro His Thr Glu Leu Ser 
130 135 140 

Gly Pro Gin Lys Lys Ser Asp Gin Val Pro Arg Asp He Gly Phe Trp 
145 150 155 160 

Cys Pro Lys His Leu Arg Thr Ser Gly Asp Gin Gly Tyr Arg Phe Leu 
165 170 175 

Gly He Glu Gin Cys Ala Pro Pro Cys Pro Asn Met Tyr Phe Lys Ser 
180 185 190 

Asp Glu Leu Asp Phe Ala Lys Ser Phe He Gly He Val Ser He Phe 
195 200 205 

Cys Leu Cys Ala Thr Leu Phe Thr Phe Leu Thr Phe Leu He Asp Val 
210 215 220 

Arg Arg Phe Arg Tyr Pro Glu Arg Pro He He Tyr Tyr Ser Val Cys 
225 230 235 240 

Tyr Ser He Val Ser Leu Met Tyr Phe Val Gly Phe Leu Leu Gly Asn 
245 250 255 

Ser Thr Ala Cys Asn Lys Ala Asp Glu Lys Leu Glu Leu Gly Asp Thr 

260 265 270 

Val Val Leu Gly Ser Lys Asn Lys Ala Cys Ser Val Val Phe Met Phe 
275 280 285 

I*eu Tyr Phe Phe Thr Met Ala Gly Thr Val Trp Trp Val He Leu Thr 
290 295 300 

He Thr Trp Phe Leu Ala Ala Gly Arg Lys Trp Ser Cys Glu Ala He 
305 310 315 320 

Glu Gin Lys Ala Val Trp Phe His Ala Val Ala Trp Gly Ala Pro Gly 
325 330 335 

Phe Leu Thr Val Met Leu Leu Ala Met Asn Lys Val Glu Gly Asp Asn 
340 345 350 

He Ser Gly Val Cys Phe Val Gly Leu Tyr Asp Leu Asp Ala Ser Arg 
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355 



360 



365 



Tyr Phe Val Leu Leu Pro Leu Cys Leu Cys Val Phe Val Gly Leu Ser 
370 375 380 

Leu Leu Leu Ala Gly lie lie Ser Leu Asn His Val Arg Gin Val lie 
385 390 395 400 

Gin His Asp Gly Arg Asn Gin Glu Lys Leu Lys Lys Phe Met lie Arg 
405 410 415 

lie Gly Val Phe Ser Gly Leu Tyr Leu Val Pro Leu Val Thr Leu Leu 
420 425 430 

Gly Cys Tyr Val Tyr Glu Leu Val Asn Arg lie Thr Trp Glu Met Thr 
435 440 445 

Trp Phe Ser Asp His Cys His Gin Tyr Arg He Pro Cys Pro Tyr Gin 
450 455 460 

Ala Asn Pro Lys Ala Arg Pro Glu Leu Ala Leu Phe Met He Lys Tyr 
465 470 475 480 

Leu Met Thr Leu He Val Gly He Ser Ala Val Phe Trp Val Gly Ser 
485 490 495 

Lys Lys Thr Cys Thr Glu Trp Ala Gly Phe Phe Lys Arg Asn Arg Lys 
500 505 510 

Arg Asp Pro He Ser Glu Ser Arg Arg Val Leu Gin Glu Ser Cys Glu 
515 520 525 

Phe Phe Leu Lys His Asn Ser Lys Val Lys His Lys Lys Lys His Gly 
530 535 540 

Ala Pro Gly Pro His Arg Leu Lys Val He Ser Lys Ser Met Gly Thr 
545 550 555 560 

Ser Thr Gly Ala Thr Thr Asn His Gly Thr Ser Ala Met Ala He Ala 

565 570 575 

Asp His Asp Tyr Leu Gly Gin Glu Thr Ser Thr Glu Val His Thr Ser 
580 585 590 

Pro Glu Ala Ser Val Lys Glu Gly Arg Ala Asp Arg Ala Asn Thr Pro 
595 600 605 

Ser Ala Lys Asp Arg Asp Cys Gly Glu Ser Ala Gly Pro Ser Ser Lys 
610 615 620 

Leu Ser Gly Asn Arg Asn Gly Arg Glu Ser Arg Ala Gly Gly Leu Lys 
625 630 635 640 

Glu Arg Ser Asn Gly Ser Glu Gly Ala Pro Ser Glu Gly Arg Val Ser 
645 650 655 

Pro Lys Ser Ser Val Pro Glu Thr Gly Leu He Asp Cys Ser Thr Ser 
660 665 670 

Gin Ala Ala Ser Ser Pro Glu Pro Thr Ser Leu Lys Gly Ser Thr Ser 
675 680 685 

Leu Pro Val His Ser Ala Ser Arg Ala Arg Lys Glu Gin Gly Ala Gly 
690 695 700 

Ser His Ser Asp Ala Glx 
705 710 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2259 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: dotible 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: mRNA 

(iii) HYPOTHETICAL: NO 

<iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Mus musculus transmembrane receptor 
(frizzled 7> mRNA, Coding region: 362.. 2080 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



TTTGAAG6TA 


ACCGGA6AA6 


CTTGTTGCTC 


GTCGCCGCAG 


AGAAAGCCGC 


ACCGTTACGT 


60 


CTCGGGGGGA 


GGGTAAGGCG 


ACACCCCTTC 


CCTCGTACCC 


CCACTCCAGG 


CCCAGGAGTT 


120 


TGAACTCCGG 


CGGCTGCGTG 


AGTGCCACGT 


GGAGGCGGCT 


GCGGCGCCCC 


TCGGCTGGCG 


180 


GCCTCGCCCC 


CGCTGTGCAG 


GCACCCTAGC 


ACCCTCGGCT 


CCGCGCCGCC 


CACGGCGGCC 


240 


CCGGCGCCGG 


GAGGACTCTC 


ATGCGCCGGC 


CGG6CG6CGG 


CGCCTCCCTG 


TATCCAAGCC 


300 


TCTCCCCAGC 


GCCTCGTCTT 


TTTCCTCCAG 


CTGAGAACGC 


CGCTGCACTC 


GCGACCGGCG 


360 


ATGCGGGGCC 


CCGGCACGGC 


GGCGTCGCAC 


TCGCCCCTGG 


GCCTCTGCGC 


CCTGGTGCTT 


420 


GCTCTTCTGG 


GCGCGCTGCC 


CACG6ACACC 


CGGGCTCAGC 


CATATCACGG 


CGAGAAAGGC 


480 


ATCTCGGTAC 


CGGACCACGG 


CTTCTGCCAG 


CCCATCTCCA 


TCCCGTTGTG 


CACGGATATC 


540 


GCCTACAACC 


AGACCATCCT 


GCCCAACCTG 


CTGGGCCACA 


CGAACCAAGA 


6GACGCGGGC 


600 


CTCGAGGTGC 


ACCAGTTCTA 


CCCTCTGGTA 


AAGGTGCAGT 


GTTCTCCTGA 


GCTACGCTTC 


660 


TTCTTATGCT 


CTAT6TACGC 


ACCCGTGTGC 


ACCGTGCTCG 


ACCAAGCCAT 


TCCTCCGTGC 


720 


CGTTCCTTGT 


GCGAGCGCGC 


CCGACAGGGC 


TGCGAGGCGC 


TCATGAACAA 


GTTCGGCTTC 


780 


CAGTGGCCAG 


AGCGGTTGCG 


CTGCGAOAAC 


TTCCCAGTGC 


ACGGTGCCGG 


CGAGATCTGC 


840 


GTGGGGCAGA 


ACACGTCCGA 


CGGCTCCGGG 


GGCGCGGGCG 


GCAGTCCCAC 


CGCCTACCCT 


900 


ACTGCTCCCT 


ACCTGCCAGA 


CCCACCTTTC 


ACTGCGATGT 


CCCCCTCAGA 


TGGCAGAGGC 


960 


CGCTTGTCTT 


TCCCCTTCTC 


GTGTCCGCGC 


CAGCTCAAGG 


TGCCCCCCTA 


CCTGGGCTAC 


1020 


CGCTTCCTAG 


GTGAGCGTGA 


CTGCGGTGCC 


CCGTGTGAGC 


CGGGCCGTGC 


TAACGGCCTC 


1080 


ATGTACTTTA 


AAGAAGAGGA 


GAGACGGTTC 


GCCCGCCTCT 


GGGTGGGTGT 


GTGGTCAGTG 


1140 


CTGTCGTGCG 


CCTCGACGCT 


CTTCACGGTG 


CTCACCTACC 


TAGTGGACAT 


GCGTCGCTTC 


1200 


AGCTATCCAG 


AGCGACCCAT 


CATCTTCCTG 


TCGGGTTGCT 


ACTTCATGGT 


GGCAGTGGCG 


1260 


CACGTGGCAG 


GCTTCCTGCT 


AGAGGACCGT 


GCCGTGTGCG 


TGGAGCGCTT 


CTCGGACGAT 


1320 


GGCTACCGCA 


CGGTGGCGCA 


GGGCACCAAG 


AAGGAGGGCT 


GCACCATCCT 


CTTCATGGTG 


1380 
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CTTTACTTCT 


TCGGTATGGC 


CAGCTCCATC 


TGGTGGGTCA 


TTCTGTCCCT 


CACTTGGTTC 


1440 


CTGGCAGCTG 


GCATGAAGTG 


GGGCCACGAG 


GCCATCGAGG 


CCAACTCGCA 


GTACTTTCAT 


1500 


CTGGCCGCGT 


GGGCTGTGCC 


AGCGGTCAAG 


ACAATCACCA 


TTTTGGCCAT 


GGGCCAGGTG 


1560 


GATGGTGACC 


TACTCAGTGG 


AGTGTGCTAC 


GTGGGCCTGT 


CTAGTGTGGA 


TGCATTGCX3G 


1620 


GGCTTCGTGC 


TGGCGCCCTT 


GTTCGTCTAC 


CTCTTCATCG 


GGACGTCCTT 


CCTGTTGGCC 


1680 


GGCTTTGTGT 


CTCTCTTTCG 


CATCCGCACC 


ATCATGAAGC 


ACGACGGCAC 


CAAGACAGAG 


1740 


AAGCTGGAGA 


AGCTGATGGT 


GCGCATCGGC 


GTCTTCAGCG 


TGCTCTACAC 


GGTGCCGGCC 


1800 


ACCATCGTGT 


TGGCCTGCTA 


CTTTTATGAG 


CAGGCCTTCC 


GAGAGCACTG 


GGAACGCACC 


1860 


TGGCTCCTGC 


AGACTTGCAA 


GAGCTACGCT 


GTGCCCTGCC 


CTCCGCGCCA 


CTTCTCTCCC 


1920 


ATGAGCCCCG 


ACTTTACAGT 


CTTCATGATC 


AAGTACCTGA 


TGACCATGAT 


CGTGGGCATC 


1980 


ACTACGGGCT 


TCTGGATCTG 


GTCGGGCAAG 


ACCCTGCAGT 


CATGGCGTCG 


CTTCTACCAC 


2040 


AQACTCAGCC 


ACAGCAGCAA 


GGGGGAAACT 


GCGGTATGAG 


CCCCGGTCCT 


TACCCACCCT 


2100 


TGCCTCTTCT 


ACCCTTTTAC 


AGGAGGAGAG 


GCATGGTAGG 


GAGAGAACTG 


CTGGGTGGGG 


2160 


GCTTGTTTCC 


GTAAGCTACC 


TGCCCCCTCC 


ACTGAGCTTT 


AACCTGGAAG 


TGAGAAGTTA 


2220 


TTTGGAQGTG 


AGAAGAGATT 


TGGGGGCGAG 


AGATGGTTT 






2259 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 573 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C> INDIVIDUAL ISOLATE: Mfz7 protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Arg Gly Pro Gly Thr Ala Ala Ser His Ser Pro Leu Gly Leu Cys 
15 10 15 

Ala Leu Val Leu Ala Leu Leu Gly Ala Leu Pro Thr Asp Thr Arg Ala 

20 25 30 

Gin Pro Tyr His Gly Glu Lys Gly lie Ser Val Pro Asp His Gly Phe 
35 40 45 

Cys Gin Pro lie Ser lie Pro Leu Cys Thr Asp lie Ala Tyr Asn Gin 
50 55 60 

Thr lie Leu Pro Asn Leu Leu Gly His Thr Asn Gin Glu Asp Ala Gly 
65 70 75 80 

Leu Glu Val His Gin Phe Tyr Pro Leu Val Lys Val Gin Cys Ser Pro 
85 90 95 
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Glu lieu Arg Phe Phe Leu Cys Ser Met Tyr Ala Pro Val Cys Thr Val 
100 105 110 

Leu Asp Gin Ala lie Pro Pro Cys Arg Ser Leu Cys Glu Arg Ala Arg 
115 120 125 

Gin Gly Cys Glu Ala Leu Met Asn Lys Phe Gly Phe Gin Trp Pro Glu 
130 135 140 

Arg Leu Arg Cys Glu Asn Phe Pro Val His Gly Ala Gly Glu lie Cys 
145 150 155 160 

Val Gly Gin Asn Thr Ser Asp Gly Ser Gly Gly Ala Gly Gly Ser Pro 
165 170 175 

Thr Ala Tyr Pro Thr Ala Pro Tyr Leu Pro Asp Pro Pro Phe Thr Ala 
180 185 190 

Met Ser Pro Ser Asp Gly Arg Gly Arg Leu Ser Phe Pro Phe Ser Cys 
195 200 205 

Pro Arg Gin Leu Lys Val Pro Pro Tyr Leu Gly Tyr Arg Phe Leu Gly 
210 215 220 

Glu Arg Asp Cys Gly Ala Pro Cys Glu Pro Gly Arg Ala Asn Gly Leu 
225 230 235 240 

Met Tyr Phe Lys Glu Glu Glu Arg Arg Phe Ala Arg Leu Trp Val Gly 

245 250 255 

Val Trp Ser Val Leu Ser Cys Ala Ser Thr Leu Phe Thr Val Leu Thr 
260 265 270 

Tyr Leu Val Asp Met Arg Arg Phe Ser Tyr Pro Glu Arg Pro lie lie 
275 280 285 

Phe Leu Ser Gly Cys Tyr Phe Met Val Ala Val Ala His Val Ala Gly 
290 295 300 

Phe Leu Leu Glu Asp Arg Ala Val Cys Val Glu Arg Phe Ser Asp Asp 
305 310 315 320 

Gly Tyr Arg Thr Val Ala Gin Gly Thr Lys Lys Glu Gly Cys Thr lie 
325 330 335 

Leu Phe Met Val Leu Tyr Phe Phe Gly Met Ala Ser Ser lie Trp Trp 
340 345 350 

Val lie Leu Ser Leu Thr Trp Phe Leu Ala Ala Gly Met Lys Trp Gly 
355 360 365 

His Glu Ala lie Glu Ala Asn Ser Gin Tyr Phe His Leu Ala Ala Trp 
370 375 380 

Ala Val Pro Ala Val Lys Thr lie Thr lie Leu Ala Met Gly Gin Val 
385 390 395 400 

Asp Gly Asp Leu Leu Ser Gly Val Cys Tyr Val Gly Leu Ser Ser Val 
405 410 415 

Asp Ala Leu Arg Gly Phe Val Leu Ala Pro Leu Phe Val Tyr Leu Phe 
420 425 430 

lie Gly Thr Ser Phe Leu Leu Ala Gly Phe Val Ser Leu Phe Arg lie 
435 440 445 

Arg Thr lie Met Lys His Asp Gly Thr Lys Thr Glu Lys Leu Glu Lys 
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450 455 

Leu Met Val Arg lie Gly Val Phe Ser 
465 470 

Thr lie Val Leu Ala Cys Tyr Phe Tyr 
485 

Trp Glu Arg Thr Trp Leu Leu Gin Thr 
500 505 

Cys Pro Pro Arg His Phe Ser Pro Met 
515 520 

Met lie Lys Tyr Leu Met Thr Met lie 
530 535 

Trp lie Trp Ser Gly Lys Thr Leu Gin 
545 550 

Arg Leu Ser His Ser Ser Lys Gly Glu 
565 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE; 

(C) INDIVIDUAL ISOLATE: Mus musculus transmembrane receptor 
(frizzled 8) gene. Coding region: 188.. 2245 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



GGGGGAGGGC 


CGGACGACTC 


CAGCCTAGGT 


TTCCAACCCT 


GCTGCCTGAA 


AAGGAGATAG 


60 


ACTGTTGCTA 


TTCTCCTCT6 


CAGAGAAAAG 


TGGGACACGA 


CCCGCTCTCC 


CTTTTCTCAG 


120 


ATTCCTCACT 


QCAGAGCCCT 


CCTGCGCGCC 


GCCTAGAGAA 


GGAGGACTTG 


GGGTCCCAGC 


180 


GCGCAGCATG 


GAGTGGGGTT 


ACCTGTTGGA 


AGTGACCTCG 


CTCCTAGCCG 


CCTTGGCGGT 


24 0 


GCTACAGCGC 


TCTAGCGGCG 


CTGCCGCGGC 


TTCGGCCAAG 


GAGCTGGCGT 


GCCAAGAGAT 


300 


CACGGTGCCG 


TTGTGCAAAG 


GCATCGGTTA 


CAACTACACT 


TACATGCCCA 


ACCAGTTCAA 


360 


CCACGACACG 


CAAGATGAGG 


CGGGCCTAGA 


GGTGCACCAG 


TTTTGGCCGC 


TGGTGGAGAT 


420 


ACAGTGCTCC 


CCGGACCTCA 


AGTTCTTTCT 


GTGTAGCATG 


TACACGCCCA 


TCTGCCTGGA 


480 


GGACTACAAG 


AAGCCTCTGC 


CGCCTTGTCG 


CTCTGTGTGT 


GAACGCGCCA 


AGGCCGGCTG 


540 


CGCGCCGCTC 


ATGCGCCAGT 


ACGGCTTTGC 


TTGGCCTGAC 


CGCATGCGCT 


GCGATCGGTT 


600 


GCCGGAGCAG 


GGCAACCCGG 


ACACTCTGTG 


CATGGACTAC 


AACCGCACCG 


ACCTCACCAC 


660 


GGCCGCGCCC 


AGCCCACCGC 


GCCGCCTGCC 


TCCGCCGCCT 


CCTCCCGGCG 


AGCAGCCGCC 


720 



460 

Val Leu Tyr Thr Val Pro Ala 
475 480 

Glu Gin Ala Phe Arg Glu His 
490 495 

Cys Lys Ser Tyr Ala Val Pro 
510 

Ser Pro Asp Phe Thr Val Phe 
525 

Val Gly lie Thr Thr Gly Phe 
540 

Ser Trp Arg Arg Phe Tyr His 
555 560 

Thr Ala Val Glx 
570 
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nnrr* r* n r*p 


V3 V— ' v. ^ O ^ ia^n-VTW 




CCACATPGTG 


GPGGPAGPAG 


/ 0 vy 


TAGGGGCAGC 






pppT*Tprsprsp 


ra^pnnnAAnn 


V> V9AOV3 Vk» V. W ^» 


OA A 








W X V«>Aw X OV» 


PG Pf3PGPPP A 


TGGTYS AGPGT 






\^\a\^ WA^ L. ^ijW 


X l» X AWfV^^ w\9 




GGPPAGATPG 


PPAAPTGTGP 


7011 




r" a o IV IV rr^^^^T 

V«A\.>AAIi»V»li.V. Ji 


X^ X X X.M\3\.^.A 




GPPTTPAPPG 


XWX XWXV9UAX 




CGGCfJTvixviVj 




ijfwX XlwvrXV»XV» 


WA^^ X X 


A^.. X V3 X Vv X ^ X A 


V»^X X^v^XVi^X 




CGATATGGAG 


CGCTTTAAGT 


IV r' ^/"v^/^ IV IV /V 
AUCwlivvAACAs 


I^Up^UAX uai a 


X XCwX^lu^la 


V.UTGTTAL.CT 


1140 


CTTCGTGTCT 


G^QGGGTACQ 


TGGTGCGCCT 


*^'*/^A^^ A 

GGTGGCAGGA 


^Anv* A^A A A 0 

catgagaaag 


TGG CCTGCAG 


1200 


CGGCGGC6CT 


CCvKtSTGuTIvi 








xGvvCGC 


1260 


AGGGGCAGCG 


GGACGGGGGG 


T\ ^ TV ^ ^^*»^ 


GGGCGCGCGC 


r^f^r^f^ A / A f^f^ 

GGLTGAGTAuG 


TV f^/^ A /*»/ 

AGGaGCTGGG 


1320 


CGCAGTTGAG 


CAGCATGTTC 


G CT ATG AGaC 


v*AwTu(j U w w L. 


GCGCTGTGCA 


CGGTGGTCTT 


1380 


TCTCCTTGTC 


TACTTTTTTG 


GCATGGCwaG 


CTCCATCTGG 


VTSf^ff^ ffi T^ TV f/^O 

X GGGTAATC C 


TGTCGCTCAC 


1440 


GTGGTTCTTG 


GCAGCTGGCA 


TGAAGTGGGG 


MHTv TV r^f^ TV ^r*^f^ 

TAACGAGGCC 


TV n*A^^A^^^«IH 

ATAGCAGGCT 


ACTCGCAGTA 


1500 


CTTCCACCTG 


GCCGCGTGGC 


TTGTGCCCAG 


CGTCAAGTCC 


ATCGCGGTGC 


TGGCGCTCAG 


1560 


CTCCGTAGAC 


GGCGACCCGG 


TGG CGGGCAT 


CTGCTACGTG 


^/^^A A/^^A#^A 

GG CAAC C AG A 


^/■^/■mfT^n*/** A ^ TV TV 

GCCTTGACAA 


162 0 


CCTACGCGGC 


TTTGTGCTGG 


CGCCACTGGT 


TATCTACCTC 


TTCATTGGGA 


CTATGTTTCT 


1680 


GTTAGCTGGC 


TTCGTGTCGC 


TGTTCCGAAT 


CCGTTCAGTC 


TV rTir~t TV TV /~» TV ^ 

ATC AAG CAG C 


TV TV /^/^ TV 

AAGGAGGTCC 


1740 


AACTAAGACA 


CACAAGCTAG 


AAAAACTCAT 


GATCCG CTTG 


GGCCTCTTCA 


CCGTGCTCTA 


1800 


CACGGTGCCC 


GCTGCCGTCG 


TTGTCGCCTG 


C CTTTTCTAT 


G AGC AG C AC A 


ACCGACCGCG 


1860 


CTGGGAGGCC 


ACGCACAACT 


GCCCATGCCT 


TCGGG AC CTG 


CAACCGGACC 


AGG CTCGCAG 


1920 


^^^^^ifc lining 
GCCCGATTAC 


GCGGTCTTCA 


TGCTt-AAGTA 


d X L. A xu X 


C 1 AVji 1 Aui X Go 


VsCATCACATC 


1980 


GGGCGTGTGG 


GTCTGGTlJCG 


^/-ITV A A /"•^/"^♦T* 

GCAACjALi X^L. X 


\9\9Al9 X ^V,. X V>k9 


CowuCCsX IGl 


GCACTAGGTG 


2040 


UTUC rGGVaCU 


TV /-I /-t TV TV r^f^r^r^r^ 


rwn/^ A /^m A 

U 1 oL. AVs 1 Av,9V7 




A r*r^r* a r*r^r^ 
oVaAVjvjrCAGCVar 


(3CQ.CTGGGGG 


2100 


CAGTGGACCv. 


GGGwCCGGwG 


GAQGTGuVrLjLv 


AwAv.(3i3CG\aA 


r^r^ r^r^n/^r* A T 
\jk3^(j\iO(3GAx 


t^r^f^tr** tin TV ^A^ 

C CCTCTACAG 


2160 




iv r*r*tifijr* r'Tva 2V 

A^k^vaVsv* L» X V7A 


WW X uuV^ova X i» 


TYSrtP A Pnrt OP 
X AwuVav. w 


AuImp X ^ X \9 X a X 


V. X X Awwv» xAA 


2220 


Va w AAA X 


X X V3 X 1»Im.\..A\9V7 


X ^ X VMW%W^\» X 


APnTnnATY2P 


PP AG A AGGGG 


przr: An 2v /^i? ZV r2 

V.'VjVaAssAururAur 




TGGGGGATGG 


GGAACCCGTG 


GGCGGCGAAG 


GGACCCCAGA 


CCGGCCAGGG 


TTCCCACCCC 


2340 


TTCCCAGTGT 


TGACTGCTAT 


AGCATGACAA 


TGAAGTGTTA 


ATGGTATCCA 


TTAGCAGCGG 


2400 


GGACTTAAAT 


6ACTCCCTTA 


G 








2421 


(2) INFORMATION FOR SEQ ID NO: 16: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 682 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Mfz8 protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Glu Trp Gly Tyr Leu Leu Glu Val Thr Ser Leu lieu Ala Ala Leu 
15 10 15 

Ala Val Leu Gin Arg Ser Ser Gly Ala Ala Ala Ala Ser Ala Lys Glu 
20 25 30 

Leu Ala Cys Gin Glu lie Thr Val Pro Leu Cys Lys Gly lie Gly Tyr 
35 40 45 

Asn Tyr Thr Tyr Met Pro Asn Gin Phe Asn His Asp Thr Gin Asp Glu 
50 55 60 

Ala Gly Leu Glu Val His Gin Phe Trp Pro Leu Val Glu lie Gin Cys 
65 70 75 80 

Ser Pro Asp Leu Lys Phe Phe Leu Cys Ser Met Tyr Thr Pro lie Cys 
85 90 95 

Leu Glu Asp Tyr Lys Lys Pro Leu Pro Pro Cys Arg Ser Val Cys Glu 
100 105 110 

Arg Ala Lys Ala Gly Cys Ala Pro Leu Met Arg Gin Tyr Gly Phe Ala 

115 120 125 

Trp Pro Asp Arg Met Arg Cys Asp Arg Leu Pro Glu Gin Gly Asn Pro 
130 135 140 

Asp Thr Leu Cys Met Asp Tyr Asn Arg Thr Asp Leu Thr Thr Ala Ala 
145 150 155 160 

Pro Ser Pro Pro Arg Arg Leu Pro Pro Pro Pro Pro Pro Gly Glu Gin 
165 170 175 

Pro Pro Ser Gly Ser Gly His Ser Arg Pro Pro Gly Ala Arg Pro Pro 
180 185 190 

His Arg Gly Gly Ser Ser Arg Gly Ser Gly Asp Ala Ala Ala Ala Pro 
195 200 205 

Pro Ser Arg Gly Gly Lys Ala Arg Pro Pro Gly Gly Gly Ala Ala Pro 
210 215 220 

Cys Glu Pro Gly Cys Gin Cys Arg Ala Pro Met Val Ser Val Ser Ser 
225 230 235 240 

Glu Arg His Pro Leu Tyr Asn Arg Val Lys Thr Gly Gin lie Ala Asn 
245 250 255 

Cys Ala Leu Pro Cys His Asn Pro Phe Phe Ser Gin Asp Glu Arg Ala 
260 265 270 

Phe Thr Val Phe Trp He Gly Leu Trp Ser Val Leu Cys Phe Val Ser 
275 280 285 

Thr Phe Ala Thr Val Ser Thr Phe Leu He Asp Met Glu Arg Phe Lys 
290 295 300 

Tyr Pro Glu Arg Pro He He Phe Leu Ser Ala Cys Tyr Leu Phe Val 
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305 3X0 315 320 

Ser Val Gly Tyr Leu Val Arg Leu Val Ala Gly His Glu Lys Val Ala. 

325 330 335 

Cys Ser Gly Gly Ala Pro Gly Ala Gly Gly Arg Gly Gly Ala Gly Gly 
340 345 350 

Ala Ala Ala Ala Gly Ala Gly Ala Ala Gly Arg Gly Ala Ser Ser Pro 

355 360 365 

Gly Ala Arg Gly Glu Tyr Glu Glu Leu Gly Ala Val Glu Gin His Val 
370 375 380 

Arg Tyr Glu Thr Thr Gly Pro Ala Leu Cys Thr Val Val Phe Leu Leu 
385 390 395 400 

Val Tyr Phe Phe Gly Met Ala Ser Ser lie Trp Trp Val lie Leu Ser 
405 410 415 

Leu Thr Trp Phe Ijeu Ala Ala Gly Met Lys Trp Gly Asn Glu Ala lie 
420 425 430 

Ala Gly Tyr Ser Gin Tyr Phe His Leu Ala Ala Trp Leu Val Pro Ser 
435 440 445 

Val Lys Ser He Ala Val Leu Ala Leu Ser Ser Val Asp Gly Asp Pro 
450 455 460 

Val Ala Gly He Cys Tyr Val Gly Asn Gin Ser Leu Asp Asn Leu Arg 
465 470 475 480 

Gly Phe Val Leu Ala Pro Leu Val He Tyr Leu Phe He Gly Thr Met 
485 490 495 

Phe Leu Leu Ala Gly Phe Val Ser Leu Phe Arg He Arg Ser Val He 
500 505 510 

Lys Gin Gin Gly Gly Pro Thr Lys Thr His Lys Leu Glu Lys Leu Met 
515 520 525 

He Arg Leu Gly Leu Phe Thr Val Leu Tyr Thr Val Pro Ala Ala Val 
530 535 540 

Val Val Ala Cys Leu Phe Tyr Glu Gin His Asn Arg Pro Arg Trp Glu 
545 550 555 560 

Ala Thr His Asn Cys Pro Cys Leu Arg Asp Leu Gin Pro Asp Gin Ala 
565 570 575 

Arg Arg Pro Asp Tyr Ala Val Phe Met Leu Lys Tyr Phe Met Cys Leu 
580 585 590 

Val Val Gly He Thr Ser Gly Val Trp Val Trp Ser Gly Lys Thr Leu 
595 600 605 

Glu Ser Trp Arg Ala Leu Cys Thr Arg Cys Cys Trp Ala Ser Lys Gly 
610 615 620 

Ala Ala Val Gly Ala Gly Ala Gly Gly Ser Gly Pro Gly Gly Ser Gly 
625 630 635 640 

Pro Gly Pro Gly Gly Gly Gly Gly His Gly Gly Gly Gly Gly Ser Leu 
645 650 655 

Tyr Ser Asp Val Ser Thr Gly Leu Thr Trp Arg Ser Gly Thr Ala Ser 
660 665 670 
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Ser Val Ser Tyr Pro Lys Gin Met Pro Leu 
675 680 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOXiATE: Amino acid sequence used to design 
YW157 sense primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Tyr Pro Glu Arg Pro lie 
1 S 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STItANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Amino acid sequence used to design 
YW15B antisense primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Trp Phe Leu Ala Ala 
1 S 
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IT IS CLAIMED: 

1 . A method of identifying a compound capable of affecting binding of a Wnt 
polypeptide to a Wnt receptor (WntR) polypeptide, conq^rising 

5 contacting such a WntR polypeptide with a selected Wnt polypeptide, in the 

presence and absence of a test compound, 

measuring the effect of the test compound on the extent of binding between said 
Wnt and said WntR, and 

identifying said conq)ound as effective to alter binding of a Wnt polypq>tide to a 
10 WntR polypeptide if its measured effect on the extent of binding is above a threshold level. 

2. The method of claim 1, wherein said threshold is a 2-fold or greater inhibition 
of binding. 

15 3. The method of claim 1, wherein said threshold is a 2-fold or greater potentiation 

of binding. 

4. The method of claim 1, wherein said Wnt polypeptide is wingless (Wg). 

20 S. The method of claim 1, wherein said WntR polypeptide is Dfz2. 

6. The method of claim 5, wherein said WntR polypeptide has the amino acid 
sequence represented as SEQ ID NO:2. 

25 7. The method of claim 1, wherein said test compound is effective to inhibit 

binding between the Wnt polypeptide and the WntR polypeptide. 

8. The method of claim 1, wherein said test compound is effective to displace the 
Wnt polypeptide from the WntR polypeptide. 

30 

9, The method of claim 1, wherein said WntR polypeptide is expressed on the 
surface of a cell transformed with an expression vector encoding said receptor. 
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10. The method of claim 9, wherein said cell is a Drosophila Sneider 2 (S2) cell 
and said expression vector encodes the WntR polypeptide Dfz2. 

11. The method of claim 1, wherein said WntR polypeptide is an N-terminal 
portion of a full-length WntR polypeptide, said portion including the cysteine-rich 
amino-terminal domain. 

12. The method of claim 11, wherein said portion is a first part of a fusion protein. 

13. The method of claim 12, wherein said fusion protein further includes a second 
portion, said second portion containing the constant domain of human IgG. 

14. The method of claim 1, further comprising preparing a pharmaceutical 
preparation of a compound identified as effective to alter binding of a Wnt polypeptide to a 
WntR polypeptide. 
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Dtz2 MRHNRLKVLI LGLv/LLLTSC RADGPLHSAD HGMGGMGMGG HGLDASPAPG 50 

OCzi MWRQILFILP -TLIQGVQRY -DQSPLDASP YYRSGGGL-- ASSG 41 

Consensus M. ...L..L. ..L PL G.G G 50 

Dfz2 YGVPAIPKDP'NLRCEEITIP MCRGIGYNMT SFPNEMMHET QDEAGLEVHQ LOO 

Dfzl TELDGLPHHN --RCEPITIS ICKNIPYNMT IMPNLIGHTK QEEAGLEVHQ 89 

Consensus P RCE.ITI. .C..I.YNMT ..PN. ..H.. Q.EAGLEVHQ 100 

Dfz2 FWPLVEIKCS PDLKFFLCSM YTPICLEDYH KPLPVCRSVC ERARSGCAPI 150 

Dfzl FAPLVKIGCS DDLQLFLCSL YVPVC-TILE RPIPPCRSLC ESAR-VCEKL 137 

Consensus F.PLV.I.CS .DL..FLCS. Y.P.C P.P.CRS.C E.AR..C... 150 

Dfz2 MQQYSFEWPE RMACEHLPLH GDPDNLCMEQ PSYTEAGSGG SSGGSGGSGS 200 

Dfzl MKTYNFNWPE NLECSKFPVH GGED-LCVAE NTTS SASTAATPTR 180 

Consensus M. .Y.F.WPE ...C...P.H G. .D.LC S 200 

Dfz2 GSGSGGKRKQ GGSGSGGSGA GGSSGSTSTK PCRGRNSKNC QNPQGEKASG 250 

Dfzl SVAKVTTRKH QTGV ESPH — RNIG 202 

Consensus .......RK G p g 250 

Dfz2 KECSCSCRSP LIFLGKEQU:. QQQSQMPlylMH HPHHWYMNLT VQRIAGVPNC 300 

Dfzl FVC P V QL- — KTPLGMG' Y-ELK VG-GKDLHDC 229 

Consensus . .C P QL M Y. .L. V C 300 

Dfz2 GIPCKGPFFS NDEKDFAGLW lALWSGLCFC STLMTLTTFI IDTERFKYPE 350 

Dfzl GAPCHAMFFP ERERTVLRYW VGSWAAVCVA SCLFTVLTFL IDSSRFRYPE 279 

Consensus G.PC...FF. ..E W ...W...C.. S.L.T..TF. ID..RF. YPE 350 

Dfz2 RPIVFLSACY FMVAVGYLS- R N-FLQNEEIA CDGLL — LRE 387 

Dfzl RAIVFLAVCY LWGCAYVAG LGAGDSVSCR EPFPPPVKLG RLQMMSTITQ 329 

Consensus R.IVFL..CY ..V...Y R , ,F 400 

Dfz2 SSTGPHSCTL VFLLTYFF-G MASSIWWVIL SFTWFLAAGL KWGNEAITKH 436 

Dfzl GHRQTTSCTV LFM-ALYFCC MAAFAWWSCL AFAWFLAAGL KWGHEAIENK 378 

Consensus SCT. .F F. . MA. . .WW. -L .F.WFLAAGL KWG.EAI . . . 450 

Dfz2 SQYFHLAAWL IPTVQSVAVL LLSAVDGDPI LGICYVGNLN PDHLKTFVLA 486 

Dfzl SHLFHLVAWA VPALQTISVL ALAKVEGDIL SGVCFVGQLD THSLGAFLIL 428 

Consensus S..FHL.AW. .P. .Q. . .VL .L. -V.GD. . -G.C.VG.L. ...L.-F... 500 

Dfz2 PLFVYLVIGT TFLMAGFVSL FRIRSVIKQQ GGVGAGVKAD KLEKLMIRIG 53 6 

Dfzl PLCIYLSIGA LFLLAGFISL FRIRTVMKTD G KRTD KLERLMLRIG 473 

Consensus PL..YL.IG. .FL.AGF.SL FRIR.V.K. . G D KLE.LM.RIG 550 

Dfz2 IFSVLYTVPA TIVIGCYLYE AAYFEDWI— KALA CPCAQVKGPG 578 

Dfzl FFSGLFILPA VGLLGCLFYE YYNFDEWMIQ WHRDICKPFS IPCPAARAPG 523 

Consensus .FS.L-.-PA ....GC..YE . . .F. .W K. . . .PC PG 600 

Dfz2 K KPLYSV LMLKYFMALA VGITSGVWIW SGKTLESWRR FWRRLLGAPD 62 5 

Dfzl SPEARPIFQI FMVKYLCSML VGVTSSVWLY SSKTMVSWRN FVERLQGKEP 573 

Consensus P M.KY VG.TS.VW.. S.KT..SWR. F..RL.G... 650 

Dfz2 RTGANQALIK QRPPIPHPYA GSGMGMPVGS AAGSLLATPY TQAGGASVAS 67 5 

Dfzl RT -R AQAYV 581 

Consensus RT R Y 700 

Dfz2 TSHHHLHHHV LKQPAASHV 694 



Fig. 1 
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cells embryos 



larvae pupae 



S2 cl-8 0-2 2-3 4-5 9-12 1© 2© a^instar 




Dfz-2 (4.0 kb) 



RP49 (0.6 kb) 



Fig. 3 
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clone-8 



S2 



Dfz-2/S2 



__++--+ + Cu 

+ _4..4..4-— + wingless 



armadillo 



a-catenin 



Fig. 4 
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medium from: 

S2 HS-wg/S2 cells 



■i 
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Fig. 6 
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